FAQ
Hello,
Does the "Data-local map tasks" counter mean the number of tasks that
the had the input data already present on the machine on they are
running on? i.e the wasn't a need to ship the data to them.
Thanks
Saptarsh

Saptarshi Guha | saptarshi.guha@gmail.com | http://www.stat.purdue.edu/~sguha

Search Discussions

  • Arun C Murthy at May 20, 2008 at 5:06 pm

    On May 20, 2008, at 9:03 AM, Saptarshi Guha wrote:

    Hello,
    Does the "Data-local map tasks" counter mean the number of tasks
    that the had the input data already present on the machine on they
    are running on? i.e the wasn't a need to ship the data to them.
    Yes. Your understanding is correct.

    More specifically it means that the map-task got scheduled on a
    machine on which one of the replicas of it's input-split-block was
    present and was served by the datanode running on that machine. *smile*

    Arun

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcommon-user @
categorieshadoop
postedMay 20, '08 at 4:04p
activeMay 20, '08 at 5:06p
posts2
users2
websitehadoop.apache.org...
irc#hadoop

2 users in discussion

Saptarshi Guha: 1 post Arun C Murthy: 1 post

People

Translate

site design / logo © 2022 Grokbase