[jira] Created: (LUCENE-1211) Small speedups to DocumentsWriter's quickSort

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] Created: (LUCENE-1211) Small speedups to DocumentsWriter's quickSort

Tim Allison (Jira)
Small speedups to DocumentsWriter's quickSort
---------------------------------------------

                 Key: LUCENE-1211
                 URL: https://issues.apache.org/jira/browse/LUCENE-1211
             Project: Lucene - Java
          Issue Type: Bug
          Components: Index
    Affects Versions: 2.3.1, 2.3
            Reporter: Michael McCandless
            Assignee: Michael McCandless
            Priority: Minor
             Fix For: 2.4


In working on LUCENE-510 I found that DocumentsWriter's quickSort can
be further optimized to handle the common case of sorting only 2
values.

I ran with this alg:

  analyzer=org.apache.lucene.analysis.standard.StandardAnalyzer
 
  doc.maker=org.apache.lucene.benchmark.byTask.feeds.LineDocMaker
 
  docs.file=/Volumes/External/lucene/wiki.txt
  doc.stored = true
  doc.term.vector = true
  doc.add.log.step=2000
  doc.maker.forever = false
 
  directory=FSDirectory
  autocommit=false
  compound=false
 
  ram.flush.mb=64
 
  { "Rounds"
    ResetSystemErase
    { "BuildIndex"
      CreateIndex
      { "AddDocs" AddDoc > : 200000
      - CloseIndex
    }
    NewRound
  } : 5
 
  RepSumByPrefRound BuildIndex

Best of 5 was 857.3 docs/sec before the optimization and 881.6 after =
2.8% speedup, on a quad-core Mac Pro with 4-drive RAID 0 array.

The fix is trivial.  I will commit shortly.



--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

[jira] Resolved: (LUCENE-1211) Small speedups to DocumentsWriter's quickSort

Tim Allison (Jira)

     [ https://issues.apache.org/jira/browse/LUCENE-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael McCandless resolved LUCENE-1211.
----------------------------------------

    Resolution: Fixed

> Small speedups to DocumentsWriter's quickSort
> ---------------------------------------------
>
>                 Key: LUCENE-1211
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1211
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: Index
>    Affects Versions: 2.3, 2.3.1
>            Reporter: Michael McCandless
>            Assignee: Michael McCandless
>            Priority: Minor
>             Fix For: 2.4
>
>
> In working on LUCENE-510 I found that DocumentsWriter's quickSort can
> be further optimized to handle the common case of sorting only 2
> values.
> I ran with this alg:
>   analyzer=org.apache.lucene.analysis.standard.StandardAnalyzer
>  
>   doc.maker=org.apache.lucene.benchmark.byTask.feeds.LineDocMaker
>  
>   docs.file=/Volumes/External/lucene/wiki.txt
>   doc.stored = true
>   doc.term.vector = true
>   doc.add.log.step=2000
>   doc.maker.forever = false
>  
>   directory=FSDirectory
>   autocommit=false
>   compound=false
>  
>   ram.flush.mb=64
>  
>   { "Rounds"
>     ResetSystemErase
>     { "BuildIndex"
>       CreateIndex
>       { "AddDocs" AddDoc > : 200000
>       - CloseIndex
>     }
>     NewRound
>   } : 5
>  
>   RepSumByPrefRound BuildIndex
> Best of 5 was 857.3 docs/sec before the optimization and 881.6 after =
> 2.8% speedup, on a quad-core Mac Pro with 4-drive RAID 0 array.
> The fix is trivial.  I will commit shortly.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]