|| at Aug 27, 2010 at 1:11 am
I will be surprised to know if anyone is using Pig on a standalone basis in production environment . Pig uses Hadoop's power to process huge data.
HDFS for storing large data and MapReduce for using distributed system for processing huge data.
You can use Pig in standalone mode for learning purpose but using it on a standalone basis in production environment won't be useful.
On 8/26/10 5:38 PM, "Somdip" wrote:
Just wanted to know has anyone used Pig on a standalone basis in production environment i.e. without integrating with Haddoop? Is that even a good idea?
How is the performance if we try parsing big files using Pig. Typically we use perl for this but often we run into memory management issues and Pig seems
to be much simpler than perl.
Please share if you have any inputs.