Hi
I'm absolutely new with using Pig, only just picked it up like 3 days ago, and still trying to wrap my head around it. I'm stuck with putting together a query.
A DUMP of my sample dataset is as follows,
log = LOAD 'example-users.txt' AS (user:chararray, page:chararray);
DUMP log;
(User1,a)
(User1,b)
(User2,f)
(User3,b)
(User2,a)
(User1,e)
(User2,b)
(User2,c)
(User3,d)
(User1,d)
(User2,e)
(User2,a)
(User3,c)
(User1,d)
(User2,c)
(User3,a)
(User1,d)
(User2,b)
(User2,e)
(User3,c)
What I'm trying to do is to say, Users visiting page 'a' also visited this list of other pages ranked by number of times the page was visited. Can anyone help or give me some guidance?
Thanks
Leslie