My requirement is to search an input string in a given file and output all
lines of the file that contains the string.
Am writing the following searchString.pig script( Have integrated hadoop
with pig. Hence, all my files are available at hdfs)
searchString.info : text file will have a single word 'sample' .
indexFile : text file will have 10 key-value pairs separated by
space, with word 'sample' appearing in 2 or 3 lines.
searchStr = load 'searchString.info' using PigStorage();
A = load 'indexFile' using PigStorage(' ') as (str,filename);
-- This string comparision works!
Y = FILTER A BY str eq 'abc';
--None of the below works
Y = FILTER A BY str eq searchStr ;
Y = FILTER A BY str eq searchStr.$0;
Instead of hardcoding the search string, I would like to search the string
available at searchStr dataatom.
1. Which way can i compare "A.str" with "searchStr" data atom?
2. Is there a way i can pass search string as an argument to the Pig Script?
Thankyou for your assistance.