FAQ
Hi,
Can you please provide pointers (your experience plus location in the code)
to look for:
a) For a given job: how the data is provided to a specific task-tracker for
the mr computation. (consider the non-practical scenario where a data node
is NOT a task node and vice-versa). Then the data shd be copied over to the
tt. I think so. Who does that? NN has an in-memory map for chunks to
location; is it like JT ask a TT to go to a specific location after
consulting with NN (based on availability/load). Where in the code?

As per TW's book, there is some communication between namenode and
job-tracker to decide which replicated chunk shd be dealt where etc. And
then TT copies the data from the DT. What is the strategy for deciding this
assignment. Where in the code?

Thanks,
Himanshu

Search Discussions

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcommon-dev @
categorieshadoop
postedAug 29, '10 at 6:03a
activeAug 29, '10 at 6:03a
posts1
users1
websitehadoop.apache.org...
irc#hadoop

1 user in discussion

Himanshu Vashishtha: 1 post

People

Translate

site design / logo © 2022 Grokbase