Parsing/indexing Open Graph meta tags from HTML

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Parsing/indexing Open Graph meta tags from HTML

mabi
Hi,

I am currently testing Nutch 2.3.1 and need to be able to parse and index Open Graph meta tags in HTML such as this one:

<meta property="og:title" content="The Rock" />

Unfortunately the parse-metatags and index-metadata only works by extracting meta tags with their name attribute and not property.

Does anyone have a workaround how I can still use Nutch to parse and index Open Graph meta tags from HTML?

Thanks for your help.

Best regards,
Mabi