Currently i am trying to join 2 tables. but see unexpected results from
join. Below are the results.
[localhost:21000] > select count(*) from demographics;
[localhost:21000] > select count(*) from useractivity;
[localhost:21000] > select count(*) from demographics join useractivity on
(demographics.seqid = useractivity.seqid);
Is the result correct? since i was making a inner join i expected count(*)
in the join to be max of first 2 queries(i.e 33282073) , but instead
getting this high number(74322301). Is my understanding correct?
Unfortunately there is some problem running hive query on the setup and
hence couldn't cross verify with Hive.
any pointers to solve this would be helpful.
Thanks and Regards
Grokbase › Groups › Hadoop › impala-user › January 2013