Cos, Arun and Sridhar,

Thanks for the fast responses. Our two main requirements are:

1. Multiple output -- Write multiple output files in each reducer.

2. Max Reducers per user -- Use Fairscheduler 0.21 or greater which would let us limit the maximum number of concurrent reducers per pool/user.

~ Niranjan.
On Nov 22, 2011, at 2:19 PM, sridhar basam wrote:

On Tue, Nov 22, 2011 at 3:05 PM, Niranjan Balasubramanian <
niranjan@cs.washington.edu> wrote:

We are currently using hadoop 0.20.203 on a 10 node cluster. We are
considering upgrading to a newer version and I have two questions in this

1) It seems 0.21 is unlikely to become a stable release anytime soon and
we are weary of moving to an unstable release. Our primary concern is the
data we have on our hdfs. I want to know if anyone has been using 0.21 in
production and would like to hear about your experiences? Any advice on
this front is appreciated.
What new features or what deficiencies in the existing release are your
trying to overcome. If primary concern is stability, wouldn't it be better
to wait till 0.23 is GA'd. I don't think there are too many people running
0.21 at scale.


2) Do we know when 0.23 is likely to become stable? There has been some
discussion on mail #dev* about 0.23 becoming stable sometime soon. Is it
going to happen by the end of this year?

~ Niranjan.

* -

Search Discussions

Discussion Posts


Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 5 of 6 | next ›
Discussion Overview
groupcommon-user @
postedNov 22, '11 at 8:05p
activeNov 22, '11 at 10:37p



site design / logo © 2022 Grokbase