|
Romain Rigaux |
at Jul 8, 2011 at 11:39 pm
|
⇧ |
| |
Hi,
This double parsing is there because I did not find an easy way to modify
the Pig plan after the first parsing.
So PigUnit parses it one time, then modifies the Pig script and then
reparses it while adding some modifications like:
- modify an alias (e.g. change A = LOAD 'txt' --> A = LOAD 'anotherdata')
- guess a schema
- remind what it the last alias...
I could post a patch where PigUnit drops all the mv (or any other shell
command) by default if you want? Maybe the plan is easier to modify now too.
Romain
On Wed, Jun 29, 2011 at 12:09 PM, Jennie Cochran-Chinn wrote:
I was wondering why assertOutput in PigTest calls registerScript
twice? Once in assertOutput and then again in getAlias? I added a mv
to the end of my pig script and its getting called each time
registerScript is called and thus failing the second time bc the
source directory is no longer there.
Thanks,
Jennie