We have a single result that is returned because it is the only page on
the site that contains the provided search terms. This works just as
expected but the summary doesn't contain anything to do with the search
query. The words in the query are right next to each other in the body
of the returned result.
Is this because Nutch only stores content for the summaries for the
first x bytes of a page? Or something completely different?
The page is quite large at 35297 bytes. And the relevant search term is
found almost near the end.
Thanks in advance.
Tribal DDB, a division of DDB UK Limited, Company No. 00933578, with its registered office situated at 12 Bishops Bridge Road, London W2 6AA.
This e-mail is intended only for the named person or entity to which it is addressed and contains valuable business information that is privileged, confidential and/or otherwise protected from disclosure. Dissemination, distribution or copying of this e-mail or the information herein by anyone other than the intended recipient, or an employee, or agent responsible for delivering the message to the intended recipient, is strictly prohibited. All contents are the copyright property of the sender. If you are not the intended recipient, you are nevertheless bound to respect the sender's worldwide legal rights. We require that unintended recipients delete the e-mail and destroy all electronic copies in their system, retaining no copies in any media.
This email has been scanned by the MessageLabs Email Security System.
For more information please visit http://www.messagelabs.com/email