This may have already been asked, but I couldn't find anything in old mails
... I did find an old bug report PIG-66 about this but it got closed with no
pointer to what the outcome was.
My question:
Is there any way to get a handle on the HDFS from inside a Pig UDF (in
particular, a StoreFunc)?
(Alternatively, if I can get the hadoop JobConf that would allow me to get
the HDFS by calling FileSystem.get(conf).)
My use case is:
I'm building a StoreFunc that creates a Lucene index, following the rubric
from the hadoop.contrib.index code, in which you first have Lucene create
index files in the local FS, and then copy them to the HDFS.
Thanks!
-Chris
--
Christopher Olston, Ph.D.
Sr. Research Scientist
Yahoo! Research