FAQ
This will get a compilation error.
The reason is that we use the sort phase in reducers to make sure we
can detect duplicate values.
We can only sort the table in one way than the other.

See https://issues.apache.org/jira/browse/HIVE-537 and
https://issues.apache.org/jira/browse/HIVE-474 for details.

Zheng
On Thu, Feb 25, 2010 at 1:01 AM, Jeff Zhang wrote:

Hi all,

I read the tutorial of Hive, and it says that "no two aggregations can have
different DISTINCT columns". Could anyone tell what is the reason ? Does the
following Distinct will been translate to map-reduce job or just do it
locally ?

INSERT OVERWRITE TABLE pv_gender_agg
SELECT pv_users.gender, count(DISTINCT pv_users.userid), count(DISTINCT
pv_users.ip)
FROM pv_users
GROUP BY pv_users.gender;

--
Best Regards

Jeff Zhang


--
Yours,
Zheng

Search Discussions

Discussion Posts

Previous

Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 2 of 10 | next ›
Discussion Overview
groupuser @
categorieshive, hadoop
postedFeb 25, '10 at 9:01a
activeMar 30, '10 at 9:23a
posts10
users5
websitehive.apache.org

People

Translate

site design / logo © 2021 Grokbase