FAQ
[HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
----------------------------------------------------------------------------------------------

Key: HADOOP-2848
URL: https://issues.apache.org/jira/browse/HADOOP-2848
Project: Hadoop Core
Issue Type: Bug
Components: contrib/hod
Affects Versions: 0.16.0
Reporter: Hemanth Yamijala


Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Search Discussions

  • Hemanth Yamijala (JIRA) at Mar 6, 2008 at 6:32 am
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Hemanth Yamijala updated HADOOP-2848:
    -------------------------------------

    Fix Version/s: 0.17.0
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Fix For: 0.17.0


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hemanth Yamijala (JIRA) at Mar 17, 2008 at 4:47 am
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Hemanth Yamijala reassigned HADOOP-2848:
    ----------------------------------------

    Assignee: Hemanth Yamijala
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hemanth Yamijala (JIRA) at Mar 18, 2008 at 6:31 am
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Hemanth Yamijala updated HADOOP-2848:
    -------------------------------------

    Attachment: 2848.patch

    The attached file modifies the behavior of the list, info and deallocate commands when the cluster directory is not found.

    list : will list the cluster state as 'cluster state unknown' and continue to print the cluster id and cluster directory. Previously, it used to not print anything
    info: will print that the cluster directory is invalid, and print the cluster id, if the cluster directory was deleted.
    deallocate: will print that the cluster directory is invalid, and deallocate the cluster, based on the cluster id, if the cluster directory was deleted, thus freeing up nodes.

    The patch also adds unit tests for testing the above conditions.
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hemanth Yamijala (JIRA) at Mar 18, 2008 at 6:31 am
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Hemanth Yamijala updated HADOOP-2848:
    -------------------------------------

    Status: Patch Available (was: Open)
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Vinod Kumar Vavilapalli (JIRA) at Mar 18, 2008 at 12:21 pm
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12579818#action_12579818 ]

    Vinod Kumar Vavilapalli commented on HADOOP-2848:
    -------------------------------------------------

    Some problems:
    * allocate a cluster, delete the cluster directory and then create the directory. hod list again shows nothing. What should be the behaviour here?
    * allocate a cluster, delete the cluster directory, create the directory and then use the same directory for another allocation. The second allocation goes through without fuss and the first allocation still runs without the user knowing it. Here, I guess, we definitely need some fix.

    Other than that, a very minor thing : deallocating such clusters says "Invalid cluster directory '<cluster-id>' specified. Deleted cluster with cluster id <cluster-id>". 'Deleting cluster' here looked a bit confusing to me on the first look, could it be something like 'freed up' or 'removed from queue'?
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hemanth Yamijala (JIRA) at Mar 18, 2008 at 5:45 pm
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Hemanth Yamijala updated HADOOP-2848:
    -------------------------------------

    Status: Open (was: Patch Available)

    Cancelling patch to incorporate Vinod's comments.
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hemanth Yamijala (JIRA) at Mar 18, 2008 at 5:49 pm
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Hemanth Yamijala updated HADOOP-2848:
    -------------------------------------

    Attachment: 2848.1.patch

    Vinod, thanks for the review.

    The attached patch fixes the problems identified in the review. In the process, I modified the error codes and messages for some of the operations so that they are now consistent.

    The behavior of the list, info and deallocate commands for a directory that was previously allocated, but was subsequently deleted, or recreated would be the same in that we would identify that this was a previously allocated directory and give out a message accordingly.

    The allocate command will now fail if the cluster directory was found to be allocated (or recreated)

    If an invalid directory is given to info or deallocate, it would just print a message saying the directory is invalid and fail.
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.1.patch, 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hemanth Yamijala (JIRA) at Mar 18, 2008 at 5:51 pm
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Hemanth Yamijala updated HADOOP-2848:
    -------------------------------------

    Status: Patch Available (was: Open)
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.1.patch, 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Vinod Kumar Vavilapalli (JIRA) at Mar 19, 2008 at 6:04 am
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12580239#action_12580239 ]

    Vinod Kumar Vavilapalli commented on HADOOP-2848:
    -------------------------------------------------

    The error code returned when you remove a cluster directory, create it again, and then use it for another allocation is 3. 12 seems to be a more appropriate error code.

    +1 for the fix otherwise.
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.1.patch, 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hemanth Yamijala (JIRA) at Mar 19, 2008 at 6:08 am
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Hemanth Yamijala updated HADOOP-2848:
    -------------------------------------

    Status: Open (was: Patch Available)
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.1.patch, 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hemanth Yamijala (JIRA) at Mar 19, 2008 at 6:10 am
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Hemanth Yamijala updated HADOOP-2848:
    -------------------------------------

    Attachment: 2848.2.patch

    Trivial change to incorporate Vinod's comment on error code. This patch is committable once Hudson approves.
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.1.patch, 2848.2.patch, 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hemanth Yamijala (JIRA) at Mar 19, 2008 at 6:10 am
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Hemanth Yamijala updated HADOOP-2848:
    -------------------------------------

    Status: Patch Available (was: Open)
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.1.patch, 2848.2.patch, 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hadoop QA (JIRA) at Mar 19, 2008 at 6:30 pm
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12580489#action_12580489 ]

    Hadoop QA commented on HADOOP-2848:
    -----------------------------------

    +1 overall. Here are the results of testing the latest attachment
    http://issues.apache.org/jira/secure/attachment/12378200/2848.2.patch
    against trunk revision 619744.

    @author +1. The patch does not contain any @author tags.

    tests included +1. The patch appears to include 8 new or modified tests.

    javadoc +1. The javadoc tool did not generate any warning messages.

    javac +1. The applied patch does not generate any new javac compiler warnings.

    release audit +1. The applied patch does not generate any new release audit warnings.

    findbugs +1. The patch does not introduce any new Findbugs warnings.

    core tests +1. The patch passed core unit tests.

    contrib tests +1. The patch passed contrib unit tests.

    Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/1998/testReport/
    Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/1998/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
    Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/1998/artifact/trunk/build/test/checkstyle-errors.html
    Console output: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/1998/console

    This message is automatically generated.
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.1.patch, 2848.2.patch, 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Devaraj Das (JIRA) at Mar 20, 2008 at 9:07 am
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Devaraj Das updated HADOOP-2848:
    --------------------------------

    Resolution: Fixed
    Status: Resolved (was: Patch Available)

    I just committed this. Thanks, Hemanth!
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.1.patch, 2848.2.patch, 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hudson (JIRA) at Mar 20, 2008 at 1:17 pm
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12580755#action_12580755 ]

    Hudson commented on HADOOP-2848:
    --------------------------------

    Integrated in Hadoop-trunk #434 (See [http://hudson.zones.apache.org/hudson/job/Hadoop-trunk/434/])
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.1.patch, 2848.2.patch, 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Hudson (JIRA) at Mar 21, 2008 at 12:21 pm
    [ https://issues.apache.org/jira/browse/HADOOP-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12581066#action_12581066 ]

    Hudson commented on HADOOP-2848:
    --------------------------------

    Integrated in Hadoop-trunk #435 (See [http://hudson.zones.apache.org/hudson/job/Hadoop-trunk/435/])
    [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work.
    ----------------------------------------------------------------------------------------------

    Key: HADOOP-2848
    URL: https://issues.apache.org/jira/browse/HADOOP-2848
    Project: Hadoop Core
    Issue Type: Bug
    Components: contrib/hod
    Affects Versions: 0.16.0
    Reporter: Hemanth Yamijala
    Assignee: Hemanth Yamijala
    Fix For: 0.17.0

    Attachments: 2848.1.patch, 2848.2.patch, 2848.patch


    Currently if the cluster directory is deleted, all state about the cluster is lost. While this in itself is not a problem, at least recovery in the sense of being able to list the torque job id and deallocation to clear up the nodes should happen correctly.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcommon-dev @
categorieshadoop
postedFeb 17, '08 at 10:13a
activeMar 21, '08 at 12:21p
posts17
users1
websitehadoop.apache.org...
irc#hadoop

1 user in discussion

Hudson (JIRA): 17 posts

People

Translate

site design / logo © 2022 Grokbase