Getting Nutch running with UTF-8

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Getting Nutch running with UTF-8

Enzo Michelangeli-2
At http://wiki.apache.org/nutch/GettingNutchRunningWithUtf8 it is suggested,
in order to handle UTF-8 characters in GET parameters, to change the
configuration of the application server. Why can't the webapp just switch
the "request" object to UTF-8 encoding, e.g. by placing in the <head>
section of the query form page:

<jsp:scriptlet> request.setCharacterEncoding("UTF-8"); </jsp:scriptlet>

I'm doing this with a Lucene-based custom web application running on Tomcat
5.5, and it appears to work fine.

Enzo