[jira] Created: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

classic Classic list List threaded Threaded
14 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
[PATCH] Efficiently retrieve sizes of field values
--------------------------------------------------

                 Key: LUCENE-762
                 URL: http://issues.apache.org/jira/browse/LUCENE-762
             Project: Lucene - Java
          Issue Type: New Feature
          Components: Store
    Affects Versions: 2.1
            Reporter: Chuck Williams


Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.

This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.


--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
     [ http://issues.apache.org/jira/browse/LUCENE-762?page=all ]

Chuck Williams updated LUCENE-762:
----------------------------------

    Attachment: SizeFieldSelector.patch

> [PATCH] Efficiently retrieve sizes of field values
> --------------------------------------------------
>
>                 Key: LUCENE-762
>                 URL: http://issues.apache.org/jira/browse/LUCENE-762
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.1
>            Reporter: Chuck Williams
>         Attachments: SizeFieldSelector.patch
>
>
> Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.
> This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.

--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org
    [ http://issues.apache.org/jira/browse/LUCENE-762?page=comments#action_12461382 ]
           
Grant Ingersoll commented on LUCENE-762:
----------------------------------------

Hi Chuck,

I haven't applied this yet, but it looks good.  My only hesitation, and this is no reflection on how you have implemented it, but the if-then-else structure around line 104 is starting to get ugly.  I can imagine it growing and growing as more cases are handled.  I was wondering if maybe we should convert to a map lookup approach.  In Java 1.5 the map would be something like:
Map<FieldSelectorResult, FieldSelectorFunctor>

where the Functor does the work of what is in the clause of each of the different cases
then, the if-else structure could be replaced by
FieldSelectorFunctor functor = map.get(FieldSelectorResult);
if (functor != null)
{
    functor.apply(doc, fi, binary, compressed, tokenize);
}
else
{
    skipField(binary, compressed)
}


The constructor/static would be responsible for instantiating the Map.  The Functors could be implemented as inner classes (although this can lead to a bunch of inner classes too)

What do you think?

> [PATCH] Efficiently retrieve sizes of field values
> --------------------------------------------------
>
>                 Key: LUCENE-762
>                 URL: http://issues.apache.org/jira/browse/LUCENE-762
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.1
>            Reporter: Chuck Williams
>         Attachments: SizeFieldSelector.patch
>
>
> Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.
> This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.

--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Assigned: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org
     [ http://issues.apache.org/jira/browse/LUCENE-762?page=all ]

Grant Ingersoll reassigned LUCENE-762:
--------------------------------------

    Assignee: Grant Ingersoll

> [PATCH] Efficiently retrieve sizes of field values
> --------------------------------------------------
>
>                 Key: LUCENE-762
>                 URL: http://issues.apache.org/jira/browse/LUCENE-762
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.1
>            Reporter: Chuck Williams
>         Assigned To: Grant Ingersoll
>         Attachments: SizeFieldSelector.patch
>
>
> Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.
> This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.

--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Updated: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org
     [ http://issues.apache.org/jira/browse/LUCENE-762?page=all ]

Grant Ingersoll updated LUCENE-762:
-----------------------------------

    Lucene Fields:   (was: [New, Patch Available])
         Priority: Minor  (was: Major)

> [PATCH] Efficiently retrieve sizes of field values
> --------------------------------------------------
>
>                 Key: LUCENE-762
>                 URL: http://issues.apache.org/jira/browse/LUCENE-762
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.1
>            Reporter: Chuck Williams
>         Assigned To: Grant Ingersoll
>            Priority: Minor
>         Attachments: SizeFieldSelector.patch
>
>
> Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.
> This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.

--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org
    [ http://issues.apache.org/jira/browse/LUCENE-762?page=comments#action_12461460 ]
           
Chuck Williams commented on LUCENE-762:
---------------------------------------

Hi Grant,

Maybe even better would be to have an appropriate method on FieldSelectorResult.  E.g.:

FieldSelectorResult.readField(doc, fieldsStream, fi, binary, compressed, tokenized)

This would eliminate the tests or map lookup in performance-critical code.






> [PATCH] Efficiently retrieve sizes of field values
> --------------------------------------------------
>
>                 Key: LUCENE-762
>                 URL: http://issues.apache.org/jira/browse/LUCENE-762
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.1
>            Reporter: Chuck Williams
>         Assigned To: Grant Ingersoll
>            Priority: Minor
>         Attachments: SizeFieldSelector.patch
>
>
> Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.
> This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.

--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12466885 ]

Grant Ingersoll commented on LUCENE-762:
----------------------------------------

This would also eliminate the need to change FieldsReader for every new FieldSelectorResult that we want.  The downside is that it would most likely involve some reworking of FieldSelectorResult.  This probably isn't a big deal, since the FieldSelector stuff hasn't been released yet and has been noted to be experimental, but it has been on trunk for a while, so it is in use, I'm sure.  I think Solr has incorporated it, but I am not sure.

Patch that incorporates your existing patch and this new approach to follow soon.

> [PATCH] Efficiently retrieve sizes of field values
> --------------------------------------------------
>
>                 Key: LUCENE-762
>                 URL: https://issues.apache.org/jira/browse/LUCENE-762
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.1
>            Reporter: Chuck Williams
>         Assigned To: Grant Ingersoll
>            Priority: Minor
>         Attachments: SizeFieldSelector.patch
>
>
> Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.
> This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12466887 ]

Grant Ingersoll commented on LUCENE-762:
----------------------------------------

Hmm, I forgot to mention that FieldInfo is package local.  This complicates things a little bit, but I could expand the proposed method to pass in the appropriate FieldInfo properties, which seems to be name and omitNorms, but that doesn't seem quite right b/c we may want other FieldInfo later.  I suppose I could copy the info into a holding structure, but that seems like a waste.  Another option is to make FieldInfo public and label it as Expert Use/Internal Use only.  It exposes some of the implementation details, but not all and wouldn't be meaningful to most people, I guess.

Any strong opinions on making FieldInfo public?

> [PATCH] Efficiently retrieve sizes of field values
> --------------------------------------------------
>
>                 Key: LUCENE-762
>                 URL: https://issues.apache.org/jira/browse/LUCENE-762
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.1
>            Reporter: Chuck Williams
>         Assigned To: Grant Ingersoll
>            Priority: Minor
>         Attachments: SizeFieldSelector.patch
>
>
> Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.
> This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: [jira] Commented: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

Mike Klaas
In reply to this post by JIRA jira@apache.org
On 1/23/07, Grant Ingersoll (JIRA) <[hidden email]> wrote:
>
>     [ https://issues.apache.org/jira/browse/LUCENE-762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12466885 ]
>
> Grant Ingersoll commented on LUCENE-762:
> ----------------------------------------
>
> This would also eliminate the need to change FieldsReader for every new FieldSelectorResult that we want.  The downside is that it would most likely involve some reworking of FieldSelectorResult.  This probably isn't a big deal, since the FieldSelector stuff hasn't been released yet and has been noted to be experimental, but it has been on trunk for a while, so it is in use, I'm sure.  I think Solr has incorporated it, but I am not sure.

Yes, we have incorporated it and it is in release 1.1.  However,  it
is a backend optimization that isn't exposed to the user (except
perhaps some doing fancy things with custom RequestHandlers), so it
isn't too much trouble to modify.

-Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12466904 ]

Chuck Williams commented on LUCENE-762:
---------------------------------------

I use FieldInfo heavily and many other package-level API's, but put my classes into Lucene packages to do this.  To bad Java doesn't have intermediate access levels between public and package, e.g., open to this package and it's subpackages.

Maybe move FIeldSelector and FieldSelectorResult out of org.apache.lucene.document and into org.apache.lucene.index as they are deeply integrated with FieldsReader?  They don't have any package-level API's at present.


> [PATCH] Efficiently retrieve sizes of field values
> --------------------------------------------------
>
>                 Key: LUCENE-762
>                 URL: https://issues.apache.org/jira/browse/LUCENE-762
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.1
>            Reporter: Chuck Williams
>         Assigned To: Grant Ingersoll
>            Priority: Minor
>         Attachments: SizeFieldSelector.patch
>
>
> Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.
> This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: [jira] Commented: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

Grant Ingersoll-2
In reply to this post by Mike Klaas
Mike,

Do you have any preference on making FieldInfo public versus moving  
the FieldSelector stuff into the index package?

-Grant

On Jan 23, 2007, at 10:23 PM, Mike Klaas wrote:

> On 1/23/07, Grant Ingersoll (JIRA) <[hidden email]> wrote:
>>
>>     [ https://issues.apache.org/jira/browse/LUCENE-762?
>> page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>> tabpanel#action_12466885 ]
>>
>> Grant Ingersoll commented on LUCENE-762:
>> ----------------------------------------
>>
>> This would also eliminate the need to change FieldsReader for  
>> every new FieldSelectorResult that we want.  The downside is that  
>> it would most likely involve some reworking of  
>> FieldSelectorResult.  This probably isn't a big deal, since the  
>> FieldSelector stuff hasn't been released yet and has been noted to  
>> be experimental, but it has been on trunk for a while, so it is in  
>> use, I'm sure.  I think Solr has incorporated it, but I am not sure.
>
> Yes, we have incorporated it and it is in release 1.1.  However,  it
> is a backend optimization that isn't exposed to the user (except
> perhaps some doing fancy things with custom RequestHandlers), so it
> isn't too much trouble to modify.
>
> -Mike
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>

--------------------------
Grant Ingersoll
Center for Natural Language Processing
http://www.cnlp.org

Read the Lucene Java FAQ at http://wiki.apache.org/jakarta-lucene/ 
LuceneFAQ



---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Commented: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/LUCENE-762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12469326 ]

Grant Ingersoll commented on LUCENE-762:
----------------------------------------

I agree, Chuck, I sometimes wish you could have JAR level access or something like that.  Off topic, but doesn't C++ have a friend operator (not that I'm wishing for the C++ days) that lets you do these kind of things.

I originally put FieldSelector in the Document package b/c I felt it was more closely associated with Documents and Fields than the Index package.  Now, I guess, since the FieldsReader is in Index, I can see a strong argument for that location.

Anyway, I thought of an alternative that should keep everyone happy:
Create a new Interface named FieldInfoAccessor which provides get methods for the properties on FieldInfo, which will implement the interface.  Pass that to FieldSelectorResult.readField method instead of FieldInfo and proceed as stated above.  

The only sticking point I am now having is what to do with the Lazy Fields, since they require the ThreadLocal in FieldsReader.  I guess I would have to pass in the cloneable Fields Reader and the threadlocal, but that doesn't seem quite right either.  Any thoughts?  Moving the FieldSelector stuff into index package wouldn't address this, I don't think.  Maybe a brilliant thought will occur to me in the morning.

> [PATCH] Efficiently retrieve sizes of field values
> --------------------------------------------------
>
>                 Key: LUCENE-762
>                 URL: https://issues.apache.org/jira/browse/LUCENE-762
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.1
>            Reporter: Chuck Williams
>         Assigned To: Grant Ingersoll
>            Priority: Minor
>         Attachments: SizeFieldSelector.patch
>
>
> Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.
> This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: [jira] Commented: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

Mike Klaas
In reply to this post by Grant Ingersoll-2
On 1/25/07, Grant Ingersoll <[hidden email]> wrote:
> Mike,
>
> Do you have any preference on making FieldInfo public versus moving
> the FieldSelector stuff into the index package?

Not at all.  Our use is pretty basic as will be easy to modify to
conform to class movement/renaming.

-Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Resolved: (LUCENE-762) [PATCH] Efficiently retrieve sizes of field values

JIRA jira@apache.org
In reply to this post by JIRA jira@apache.org

     [ https://issues.apache.org/jira/browse/LUCENE-762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll resolved LUCENE-762.
------------------------------------

    Resolution: Fixed

I have committed the original patch.  All tests pass.  In the end, I could not find a way I felt comfortable with for getting rid of the if-then-else clause in FieldsReader.  I did add a TODO item there to remind us to go back and take a look at it again later.

Since the if clauses are ordered according to their most common usages (I think), I don't think there will be much of a performance issue w/ the current approach.

> [PATCH] Efficiently retrieve sizes of field values
> --------------------------------------------------
>
>                 Key: LUCENE-762
>                 URL: https://issues.apache.org/jira/browse/LUCENE-762
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.1
>            Reporter: Chuck Williams
>         Assigned To: Grant Ingersoll
>            Priority: Minor
>         Attachments: SizeFieldSelector.patch
>
>
> Sometimes an application would like to know how large a document is before retrieving it.  This can be important for memory management or choosing between algorithms, especially in cases where documents might be very large.
> This patch extends the existing FieldSelector mechanism with two new FieldSelectorResults:  SIZE and SIZE_AND_BREAK.  SIZE creates fields on the retrieved document that store field sizes instead of actual values.  SIZE_AND_BREAK is especially efficient if one field comprises the bulk of the document size (e.g., the body field) and can thus be used as a reasonable size approximation.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]