Analyze

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Analyze

R.Mayoran
Dear Doug,

I have a question on "Analyze".

When I tried to run "bin/nutch analyze <db_dir> 1" on a segment of 5million
pages, it took aorund 1 hour. I want to do it for 100 times. Is there any
way to speed-up this process?

Could you please sugest the appropreate number of iterations to increase the
degree of precision of the whole calculation? Is 100 enough?

Thanking you in anticipation.

Mayoran.