Retrieving text content from html files

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Retrieving text content from html files

Bugzilla from pau243@gmail.com
Hello,
I am developing a Nutch plugin that needs to read the text content from some URL's. I think that parse-html plugin contains the necessary code to do so, but I don't know what methods to use and how to use them. What should I do?
Thanks.