nutch search log and analysis tool?

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

nutch search log and analysis tool?

AJ Chen-2
To promote open data on semantic web, I'm building a search engine focusing
on the R&D space. This community search site  - web2express.org site is
powered by nutch, big thanks to nutch development community. It can search
html documents as well as semantic documents that are created with SPE and
BOON ontologies <http://esw.w3.org/topic/HCLS/ScientificPublishingTaskForce>.
This effort is for supporting the scientific publishing task force under
W3C. The hope is that, through open development process, the R&D community
will be able to gradually figure out a better search engine that takes
advantage of the emerging semantic web layer.

Web2x search engine currently has over 2 million web documents from the R&D
space. But there is almost zero semantic document  since relevant ontologies
and semantic publishing tools <http://www.web2express.org/openlab/> have
just appeared. Hopefully, when people see their data can be searched and
shared more effectively on community search engines, they will put out more
and more raw data in semantic format.

Questions: How to configure nutch to write log messages into files during
search? Any recommendation for log analysis tool to understand the usage of
the search engine?

Thanks,
AJ
--
AJ Chen, PhD
Palo Alto, CA
http://web2express.org
"Open data on semantic web"