Hello,
Nutch is stalling in the fetch process. I've run it twice now, and it is
stopping on the *same* URL both times. I don't get what's going on!
The last status report was:
060810 145315 status: segment 20060810142649, 7900 pages, 14 errors,
98421231 bytes, 1571224 ms
060810 145315 status: 5.0279274 pages/s, 489.3738 kb/s, 12458.384 bytes/page
Then, exactly 94 documents later with no errors in between, it just stops.
On what appears to be a perfectly normal URL and a perfectly normal page. I
don't get it.
How can I debug this situation further, to see what's going on?
I'm really frustrated since I don't know where to start looking.
Nutch is still running, taking up a lot of CPU. I don't want to kill it
unless it really stuck. How can I tell?
Ben