FAQ
Hi all,

I read the tutorial of Hive, and it says that "no two aggregations can have
different DISTINCT columns". Could anyone tell what is the reason ? Does the
following Distinct will been translate to map-reduce job or just do it
locally ?

INSERT OVERWRITE TABLE pv_gender_agg
SELECT pv_users.gender, count(DISTINCT pv_users.userid),
count(DISTINCT pv_users.ip)
FROM pv_users
GROUP BY pv_users.gender;


--
Best Regards

Jeff Zhang

Search Discussions

Discussion Posts

Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 1 of 10 | next ›
Discussion Overview
groupuser @
categorieshive, hadoop
postedFeb 25, '10 at 9:01a
activeMar 30, '10 at 9:23a
posts10
users5
websitehive.apache.org

People

Translate

site design / logo © 2021 Grokbase