FAQ
Hi,

Currently i am trying to join 2 tables. but see unexpected results from
join. Below are the results.
========================================================
[localhost:21000] > select count(*) from demographics;
6132409
[localhost:21000] > select count(*) from useractivity;
33282073
[localhost:21000] > select count(*) from demographics join useractivity on
(demographics.seqid = useractivity.seqid);
74322301
[localhost:21000] >
=========================================================

Is the result correct? since i was making a inner join i expected count(*)
in the join to be max of first 2 queries(i.e 33282073) , but instead
getting this high number(74322301). Is my understanding correct?

Unfortunately there is some problem running hive query on the setup and
hence couldn't cross verify with Hive.

any pointers to solve this would be helpful.

Thanks and Regards
Suresh

--

Search Discussions

Discussion Posts

Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 1 of 2 | next ›
Discussion Overview
groupimpala-user @
categorieshadoop
postedJan 24, '13 at 12:50p
activeJan 31, '13 at 11:45a
posts2
users1
websitecloudera.com
irc#hadoop

1 user in discussion

Suresh K: 2 posts

People

Translate

site design / logo © 2023 Grokbase