Solr indexing HTML metatags from Nutch

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Solr indexing HTML metatags from Nutch

ML mail
Hello,

I am using Nutch 1.4 with Solr 3.6.0 and would like to get the HTML keywords and description metatags indexed into Solr. On the Nutch side I have followed the http://wiki.apache.org/nutch/IndexMetatags to get nutch parsing the extracting the metatags (using index-metatags and parse-metatags plugins) but now when I run the solrindex they simply don't get indexed. 

In Solr I am using the schema.xml provided by Nutch and have added the following fields for the metatags:
 
        <!-- fields for the metatags plugin -->
        <field name="metatag.description" type="text" stored="true" indexed="true"/>
        <field name="metatag.keywords" type="text" stored="true" indexed="true"/>

and have created a solrindex-mapping.xml file as follow:

<mapping>
<fields>
<field dest="description" source="metatag.description"/>
<field dest="keywords" source="metatag.keywords"/>
</fields>
</mapping>

the rest is pretty much a default install of Solr. So now my question is why can't I see the metatags indexed in solr? Did I forget maybe to configure something in Solr?

Any suggestions are welcome.

Thanks
M.L.