I am doing an inner join on two relations say A, B.
A has fields - Word1:chararray, Word2:chararray, Word3:chararray,
B has fields - UniqueWord1:chararray, UniqueID:long
Facts about the relations:
- Relation B has no duplicates, no NULLs in either fields.
- Relation A has 840K records
- Relation B has 340 records
join_A_B = JOIN A BY Word1, B BY UniqueWord1;
I expected the join to have <= 840K records. However the join returns 860K
records. While I debug, I just thought of asking here. Any thoughts on what
could be wrong?