I am using CDH3U4 release versions of hadoop, hue, hive, sqoop.
I have three node cluster. All nodes show up in "live nodes" section.
In hue query editor, when i execute a query the job runs successfully
all the time.
The only doubt(problem) i have is, all map and reduce runs on a single node
To be clear, one job runs everything(map and reduce tasks) on node1 and
after that the next job
runs on either the same node1 or node2(but all map and reduce tasks runs on
single node if started).
What i need to know is whether i can choose which nodes to run map and
i.e., map tasks on one node and reduce tasks on the other node.
Can anyone give an explanation or a solution to my problem.
Thanks in advance!!!