FAQ
Try to restore failed replicas of Name Node storage (at checkpoint time)
------------------------------------------------------------------------

Key: HADOOP-4885
URL: https://issues.apache.org/jira/browse/HADOOP-4885
Project: Hadoop Core
Issue Type: Improvement
Components: dfs
Reporter: Boris Shkolnik




--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Search Discussions

  • Boris Shkolnik (JIRA) at Dec 16, 2008 at 7:35 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12657128#action_12657128 ]

    Boris Shkolnik commented on HADOOP-4885:
    ----------------------------------------

    Problem:
    If one of the replicas of the NameNode storage fails for whatever reason (for example temporarily failure of NFS) this Storage object is removed from the list of storage objects forever. It can be added back only on restart of the NameNode. We propose to check the status of a failed storage on every checkpoint and if it becomes valid - try to restore the edits and fsimage.
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Dec 22, 2008 at 6:41 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12658620#action_12658620 ]

    Boris Shkolnik commented on HADOOP-4885:
    ----------------------------------------

    Current implementation:
    There is a list of StorageDir objects associated with each FSImage.
    Also there is a list of EditLogs with each FSImage. One edit log has a
    corresponding StorageDir (same directory). When an IO error happens a
    corresponding EditLog and StorageDir are removed from the corresponding
    lists.


    Suggested solution:
    When a StorageDir is removed - instead of discarding it we will put it into
    a separate list (removedDir list).
    Edit log is removed and discarded.
    When a secondary node starts a checkpoint it first "rolls" editLogs
    (rollEditLogs).This function verifies that there is no edits.new in any of
    the currently active directories and than create them.

    Before it actually creates edits.new, we can go over the list of all the
    removed dirs and check if they became writable. If so - we can put them back
    into the list. So edits.new will be created there. We also will create
    EditLogs object. And when later checkpoint "puts" (putFSImage) fsimage
    there - the directory will became active.


    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Dec 22, 2008 at 6:45 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12658621#action_12658621 ]

    Boris Shkolnik commented on HADOOP-4885:
    ----------------------------------------

    Konstantin Shvachko commented:
    We should not place them into removeDir, but rather re-read configuration
    at this moment. This would also help if a new directory was configured
    as a new storage.

    If we go this route - we should probably only allow addition of the new (or restoration of the failed) directories. No removal of existing/working ones.
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Jan 5, 2009 at 11:30 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Boris Shkolnik reassigned HADOOP-4885:
    --------------------------------------

    Assignee: Boris Shkolnik
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Jan 22, 2009 at 12:46 am
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12666013#action_12666013 ]

    Boris Shkolnik commented on HADOOP-4885:
    ----------------------------------------

    Using configuration to figure out new or failed storage replicas may have some undesirable side effects. We do not keep the original configuration around. So the only option for us will be to reload a "default" configuration. This "default" configuration may or may not be the same as the one used when the cluster was started. One example of such a case would be any test that creates configuration on the flight by populating some of its values in the setUP or init functions. In this case first checkpoint will try to use all the settings from the default configuration instead of fixing the failed ones.

    Taking this into account I suggest we go back to the original propose, namely - remembering failed replicas, and trying to restore them on every checkpoint.
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Jan 22, 2009 at 1:42 am
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12666032#action_12666032 ]

    Boris Shkolnik commented on HADOOP-4885:
    ----------------------------------------

    Another issue is testing. To test this feature we would need to simulate failure to write into edit logs. For manual testing I used mounting.
    But that may be non-portable to all the systems, so I need another solution.
    My suggestions is to create a "mock" of the stream class. It will extend from EditLogFileOutputStream and override write() methods.
    It will also introduce a boolean member flag to specify if it is in a "failure" mode. When the flag is set - the write methods will throw IOException, if not they will call corresponding functions from the super classes.

    Class will look something like this:
    ------------
    class EditLogFileErrorTestOutputStream extends EditLogFileOutputStream {
    private boolean throwException = false;
    public void setThrowException(boolean val){
    throwException = val;
    }

    public void write(int b) throws IOException {
    if(throwException)
    throw new IOException("Simulated IOException in write()");
    else
    super.write(b);
    }
    }
    --------------

    configuration will have a setting "dfs.name.dir.simulateError" to specify that it needs editLog streams of the EditLogFileErrorTestOutputStream class.

    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Jan 27, 2009 at 10:17 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12667512#action_12667512 ]

    Boris Shkolnik commented on HADOOP-4885:
    ----------------------------------------

    other suggestions for simulating storage failures in the test:
    1. Using Security Manager
    unfortunately security manager is consulted during opening of a file for writing, but not for a writing into already open stream.

    2. deleting directories and calling doCheckpoint to force creation of a new edits file in these directories (which will fail and cause the storage to be moved to removedStorages).
    This looks promising. I am going to investigate this approach.
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Jan 29, 2009 at 1:37 am
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Boris Shkolnik updated HADOOP-4885:
    -----------------------------------

    Attachment: HADOOP-4885.patch
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Jan 29, 2009 at 7:02 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Boris Shkolnik updated HADOOP-4885:
    -----------------------------------

    Attachment: HADOOP-4885.patch
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Jan 29, 2009 at 7:02 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12668552#action_12668552 ]

    Boris Shkolnik commented on HADOOP-4885:
    ----------------------------------------

    running patch-test on trunk

    ANT_HOME=/home/hadoopqa/tools/ant/apache-ant-1.7.1 ant -Dpatch.file=../patches/HADOOP-4885.patch -Dfindbugs.home=/home/ndaley/tools/findbugs/latest -Dforrest.home=/home/ndaley/tools/forrest/latest -Djava5.home=/usr/releng/tools/java/jdk1.5.0_06 -Dscratch.dir=../scratch_dir/ test-patch


    +1 overall.
    [exec]
    [exec] +1 @author. The patch does not contain any @author tags.
    [exec]
    [exec] +1 tests included. The patch appears to include 3 new or modified tests.
    [exec]
    [exec] +1 javadoc. The javadoc tool did not generate any warning messages.
    [exec]
    [exec] +1 javac. The applied patch does not increase the total number of javac compiler warnings.
    [exec]
    [exec] +1 findbugs. The patch does not introduce any new Findbugs warnings.
    [exec]
    [exec] +1 Eclipse classpath. The patch retains Eclipse classpath integrity.
    [exec]

    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Jan 30, 2009 at 1:10 am
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Boris Shkolnik updated HADOOP-4885:
    -----------------------------------

    Status: Patch Available (was: Open)
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hadoop QA (JIRA) at Jan 31, 2009 at 7:38 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12669218#action_12669218 ]

    Hadoop QA commented on HADOOP-4885:
    -----------------------------------

    -1 overall. Here are the results of testing the latest attachment
    http://issues.apache.org/jira/secure/attachment/12399026/HADOOP-4885.patch
    against trunk revision 739416.

    +1 @author. The patch does not contain any @author tags.

    +1 tests included. The patch appears to include 3 new or modified tests.

    +1 javadoc. The javadoc tool did not generate any warning messages.

    +1 javac. The applied patch does not increase the total number of javac compiler warnings.

    +1 findbugs. The patch does not introduce any new Findbugs warnings.

    +1 Eclipse classpath. The patch retains Eclipse classpath integrity.

    +1 release audit. The applied patch does not increase the total number of release audit warnings.

    +1 core tests. The patch passed core unit tests.

    -1 contrib tests. The patch failed contrib unit tests.

    Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3781/testReport/
    Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3781/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
    Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3781/artifact/trunk/build/test/checkstyle-errors.html
    Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3781/console

    This message is automatically generated.
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Konstantin Shvachko (JIRA) at Feb 3, 2009 at 1:30 am
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12669836#action_12669836 ]

    Konstantin Shvachko commented on HADOOP-4885:
    ---------------------------------------------

    # {{storageList()}} should be called {{listStorageDirectories()}}, it should be a member of the {{Storage}} class, and then you can use simple loop like
    {code}
    for (StorageDirectory sd : storageDirs) {
    {code}
    Also it would be better to use {{sd}} instead of {{st}}.
    # There are some trivial changes like empty line and adding { } in a one line if statement. It would be better to remove those.
    # Long lines in {{tryToRestoreRemovedStorage()}} should be split.
    # In line
    {code}
    if(root.exists() && root.canWrite() && FileUtil.fullyDelete(root) && root.mkdir()) {
    format(sd);
    {code}
    You do all operations twice because {{format()}} does {{fullyDelete}} and {{mkDirs}} inside.
    # In {{FSEditLog.logEdit()}} you should use {{warn(Object arg, Throwable arg);}} rather than printing {{ie.getLocalizedMessage()}}
    # If possible please keep line length within the 80 margin acording to hadoop coding style.
    # "failed" vs "faild" in {{getEditLogSize()}}
    # In {{rollEditLog()}}
    ## {{// check if any of failed storage is now *available*}} and put it back
    ## You replaced {{processIOError(sd);}} by simply removing edit streams from the list of streams. That does not seem to be right. The name-node should fail if the last stream is being removed.

    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Feb 5, 2009 at 8:06 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Boris Shkolnik updated HADOOP-4885:
    -----------------------------------

    Attachment: HADOOP-4885-1.patch

    implemented comments
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885-1.patch, HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Konstantin Shvachko (JIRA) at Feb 9, 2009 at 7:49 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Konstantin Shvachko updated HADOOP-4885:
    ----------------------------------------

    Status: Open (was: Patch Available)
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885-1.patch, HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Konstantin Shvachko (JIRA) at Feb 9, 2009 at 7:53 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Konstantin Shvachko updated HADOOP-4885:
    ----------------------------------------

    Status: Patch Available (was: Open)

    +1 This looks good.
    Need test results. Resubmitting.
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885-1.patch, HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hadoop QA (JIRA) at Feb 9, 2009 at 9:16 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12672012#action_12672012 ]

    Hadoop QA commented on HADOOP-4885:
    -----------------------------------

    -1 overall. Here are the results of testing the latest attachment
    http://issues.apache.org/jira/secure/attachment/12399585/HADOOP-4885-1.patch
    against trunk revision 742698.

    +1 @author. The patch does not contain any @author tags.

    +1 tests included. The patch appears to include 3 new or modified tests.

    +1 javadoc. The javadoc tool did not generate any warning messages.

    +1 javac. The applied patch does not increase the total number of javac compiler warnings.

    +1 findbugs. The patch does not introduce any new Findbugs warnings.

    +1 Eclipse classpath. The patch retains Eclipse classpath integrity.

    +1 release audit. The applied patch does not increase the total number of release audit warnings.

    -1 core tests. The patch failed core unit tests.

    -1 contrib tests. The patch failed contrib unit tests.

    Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3820/testReport/
    Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3820/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
    Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3820/artifact/trunk/build/test/checkstyle-errors.html
    Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3820/console

    This message is automatically generated.
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885-1.patch, HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Feb 9, 2009 at 11:32 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12672051#action_12672051 ]

    Boris Shkolnik commented on HADOOP-4885:
    ----------------------------------------

    failed test is reported in HADOOP-5172 to fail regularly on Hudson builds.
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885-1.patch, HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Feb 13, 2009 at 10:19 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Boris Shkolnik updated HADOOP-4885:
    -----------------------------------

    Attachment: HADOOP-4885-3.patch
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885-1.patch, HADOOP-4885-3.patch, HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Boris Shkolnik (JIRA) at Feb 14, 2009 at 12:05 am
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Boris Shkolnik updated HADOOP-4885:
    -----------------------------------

    Attachment: HADOOP-4885-3.patch

    added license information to the new file.
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Attachments: HADOOP-4885-1.patch, HADOOP-4885-3.patch, HADOOP-4885-3.patch, HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Konstantin Shvachko (JIRA) at Feb 17, 2009 at 6:33 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Konstantin Shvachko updated HADOOP-4885:
    ----------------------------------------

    Resolution: Fixed
    Fix Version/s: 0.21.0
    Hadoop Flags: [Reviewed]
    Status: Resolved (was: Patch Available)

    I just committed this. Thank you Boris.
    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Fix For: 0.21.0

    Attachments: HADOOP-4885-1.patch, HADOOP-4885-3.patch, HADOOP-4885-3.patch, HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hudson (JIRA) at Feb 18, 2009 at 2:53 pm
    [ https://issues.apache.org/jira/browse/HADOOP-4885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12674640#action_12674640 ]

    Hudson commented on HADOOP-4885:
    --------------------------------

    Integrated in Hadoop-trunk #758 (See [http://hudson.zones.apache.org/hudson/job/Hadoop-trunk/758/])
    . Committing additional file TestStorageRestore.java.
    . Try to restore failed name-node storage directories at checkpoint time. Contributed by Boris Shkolnik.

    Try to restore failed replicas of Name Node storage (at checkpoint time)
    ------------------------------------------------------------------------

    Key: HADOOP-4885
    URL: https://issues.apache.org/jira/browse/HADOOP-4885
    Project: Hadoop Core
    Issue Type: Improvement
    Components: dfs
    Reporter: Boris Shkolnik
    Assignee: Boris Shkolnik
    Fix For: 0.21.0

    Attachments: HADOOP-4885-1.patch, HADOOP-4885-3.patch, HADOOP-4885-3.patch, HADOOP-4885.patch, HADOOP-4885.patch

    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcommon-dev @
categorieshadoop
postedDec 16, '08 at 7:29p
activeFeb 18, '09 at 2:53p
posts23
users1
websitehadoop.apache.org...
irc#hadoop

1 user in discussion

Hudson (JIRA): 23 posts

People

Translate

site design / logo © 2023 Grokbase