[jira] Created: (NUTCH-617) Cached Text Only

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (NUTCH-617) Cached Text Only

Hudson (Jira)
Cached Text Only
----------------

                 Key: NUTCH-617
                 URL: https://issues.apache.org/jira/browse/NUTCH-617
             Project: Nutch
          Issue Type: New Feature
            Reporter: Siddharth Jha
            Priority: Critical


Hello All

I would like to know if it is possible to do Cached Text implementation of webpages in Nutch. By Cached Text , I mean that this should store only the text part of the webpage without any images??

Thanks
Siddharth

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply | Threaded
Open this post in threaded view
|

[jira] Closed: (NUTCH-617) Cached Text Only

Hudson (Jira)

     [ https://issues.apache.org/jira/browse/NUTCH-617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrzej Bialecki  closed NUTCH-617.
-----------------------------------

    Resolution: Invalid

This type of question belongs to the nutch-user mailing list - please ask there.

> Cached Text Only
> ----------------
>
>                 Key: NUTCH-617
>                 URL: https://issues.apache.org/jira/browse/NUTCH-617
>             Project: Nutch
>          Issue Type: New Feature
>            Reporter: Siddharth Jha
>            Priority: Critical
>
> Hello All
> I would like to know if it is possible to do Cached Text implementation of webpages in Nutch. By Cached Text , I mean that this should store only the text part of the webpage without any images??
> Thanks
> Siddharth

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.