[
https://issues.apache.org/jira/browse/HADOOP-4967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12660063#action_12660063 ]
Amareshwari Sriramadasu commented on HADOOP-4967:
-------------------------------------------------
The logs for the above scenario:
2008-12-31 06:12:11,255 INFO org.apache.hadoop.mapred.JvmManager: In JvmRunner constructed JVM ID: jvm_200812310557_0001_m_-700127208
2008-12-31 06:12:11,266 INFO org.apache.hadoop.mapred.TaskTracker: Trying to launch : attempt_200812310557_0001_m_001155_0
2008-12-31 06:12:11,286 INFO org.apache.hadoop.mapred.JvmManager: JVM Runner jvm_200812310557_0001_m_-700127208 spawned.
2008-12-31 06:12:11,305 INFO org.apache.hadoop.mapred.TaskTracker: In TaskLauncher, current free slots : 1 and trying to launch attempt_200812310557_0001_m_001155_0
2008-12-31 06:12:20,678 INFO org.apache.hadoop.mapred.TaskTracker: Received 'KillJobAction' for job: job_200812310557_0001
2008-12-31 06:12:20,678 INFO org.apache.hadoop.mapred.TaskRunner: attempt_200812310557_0001_m_000317_0 done; removing files.
2008-12-31 06:12:20,681 INFO org.apache.hadoop.mapred.IndexCache: Map ID attempt_200812310557_0001_m_000317_0 not found in cache
2008-12-31 06:12:20,682 INFO org.apache.hadoop.mapred.TaskTracker: addFreeSlot : current free slots : 1
2008-12-31 06:12:20,682 INFO org.apache.hadoop.mapred.TaskRunner: attempt_200812310557_0001_m_001040_0 done; removing files.
2008-12-31 06:12:20,685 INFO org.apache.hadoop.mapred.IndexCache: Map ID attempt_200812310557_0001_m_001040_0 not found in cache
2008-12-31 06:12:20,685 INFO org.apache.hadoop.mapred.TaskTracker: addFreeSlot : current free slots : 2
2008-12-31 06:12:20,686 INFO org.apache.hadoop.mapred.TaskRunner: attempt_200812310557_0001_m_001155_0 done; removing files.
2008-12-31 06:12:20,688 INFO org.apache.hadoop.mapred.IndexCache: Map ID attempt_200812310557_0001_m_001155_0 not found in cache
2008-12-31 06:12:20,688 INFO org.apache.hadoop.mapred.TaskRunner: attempt_200812310557_0001_m_000136_0 done; removing files.
2008-12-31 06:12:20,691 INFO org.apache.hadoop.mapred.IndexCache: Map ID attempt_200812310557_0001_m_000136_0 not found in cache
2008-12-31 06:12:20,692 WARN org.apache.hadoop.mapred.TaskTracker: Unknown child task finshed: attempt_200812310557_0001_m_001040_0. Ignored.
2008-12-31 06:12:20,693 INFO org.apache.hadoop.mapred.JvmManager: In JvmRunner constructed JVM ID: jvm_200812310557_0001_m_-1030395660
2008-12-31 06:12:20,693 INFO org.apache.hadoop.mapred.JvmManager: JVM Runner jvm_200812310557_0001_m_-1030395660 spawned.
2008-12-31 06:12:20,694 WARN org.apache.hadoop.mapred.TaskTracker: Unknown child task finshed: attempt_200812310557_0001_m_001155_0. Ignored.
2008-12-31 06:12:28,532 INFO org.apache.hadoop.mapred.TaskTracker: LaunchTaskAction (registerTask): attempt_200812310557_0002_m_000525_0 task's state:UNASSIGNED
2008-12-31 06:12:28,533 INFO org.apache.hadoop.mapred.TaskTracker: LaunchTaskAction (registerTask): attempt_200812310557_0002_m_000562_0 task's state:UNASSIGNED
2008-12-31 06:12:28,533 INFO org.apache.hadoop.mapred.TaskTracker: Trying to launch : attempt_200812310557_0002_m_000525_0
2008-12-31 06:12:28,533 INFO org.apache.hadoop.mapred.TaskTracker: In TaskLauncher, current free slots : 2 and trying to launch attempt_200812310557_0002_m_0
00525_0
2008-12-31 06:12:29,576 INFO org.apache.hadoop.mapred.JvmManager: JVM : jvm_200812310557_0001_m_-700127208 exited. Number of tasks it ran: 0
2008-12-31 06:12:29,790 INFO org.apache.hadoop.mapred.JvmManager: JVM : jvm_200812310557_0001_m_540025796 exited. Number of tasks it ran: 0
2008-12-31 06:12:48,271 INFO org.apache.hadoop.mapred.TaskTracker: Trying to launch : attempt_200812310557_0002_m_000562_0
2008-12-31 06:12:48,272 INFO org.apache.hadoop.mapred.TaskTracker: In TaskLauncher, current free slots : 1 and trying to launch attempt_200812310557_0002_m_000562_0
2008-12-31 06:12:48,290 INFO org.apache.hadoop.mapred.JvmManager: In JvmRunner constructed JVM ID: jvm_200812310557_0002_m_-612394729
2008-12-31 06:12:48,292 INFO org.apache.hadoop.mapred.JvmManager: JVM Runner jvm_200812310557_0002_m_-612394729 spawned.
2008-12-31 06:12:52,223 WARN org.apache.hadoop.mapred.TaskRunner: attempt_200812310557_0002_m_000562_0 Child Error
java.lang.RuntimeException: Inconsistent state!!! JVM Manager reached an unstable state while reaping a JVM for task: attempt_200812310557_0002_m_000562_0 Number of active JVMs:2
JVMId jvm_200812310557_0001_m_-1030395660 #Tasks ran: 0 Currently busy? true Currently running: attempt_200812310557_0001_m_001155_0
JVMId jvm_200812310557_0002_m_-612394729 #Tasks ran: 0 Currently busy? true Currently running: attempt_200812310557_0002_m_000525_0
at org.apache.hadoop.mapred.JvmManager$JvmManagerForType.reapJvm(JvmManager.java:253)
at org.apache.hadoop.mapred.JvmManager$JvmManagerForType.access$000(JvmManager.java:114)
at org.apache.hadoop.mapred.JvmManager.launchJvm(JvmManager.java:77)
at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:411)
2008-12-31 06:12:53,518 INFO org.apache.hadoop.mapred.JvmManager: JVM : jvm_200812310557_0001_m_-1030395660 exited. Number of tasks it ran: 0
Inconsistent state in JVM manager
---------------------------------
Key: HADOOP-4967
URL:
https://issues.apache.org/jira/browse/HADOOP-4967Project: Hadoop Core
Issue Type: Bug
Components: mapred
Reporter: Amareshwari Sriramadasu
The following is the scenario:
-> TT got a LaunchTaskAction. And It found the slot free. It started the TaskRunner
-> TT received KillJobActtion. It did a runner.kill().
But, runner.kill() didn't remove JVM Runner from jvmIdToRunner , as it is not polulated yet.
It released the slots.
-> TaskRunner thread spawned jvm and populated jvmIdToRunner.
-> The next task getting launched on the slot finds it as busy.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.