protocol redirect for nutch 0.7.2

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

protocol redirect for nutch 0.7.2

Sunnyvale Fl
Hi,
Is there an easy way to change how nutch 0.7.2 handles protocol redirects?
Currently I believe if a site www.foo.com redirects 30x to www.bar.com,
nutch will index the content under foo instead of bar.  I read that in
version 0.8 it is fixed.  Is there a fix for 0.7.2, or can someone suggest
one?  Thanks!