[
https://issues.apache.org/jira/browse/LUCENE-2662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Simon Willnauer updated LUCENE-2662:
------------------------------------
Attachment: LUCENE-2662.patch
Next iteration - seems to be very close!
I have applied the following changes:
* introduces a AtomicLong to track bytesUsed in DocumetnsWriter,
TermsHashPerField, ByteRefHash and RecyclingByteBlockAllocator
* Factored out a BytesStartArray class from BytesRefHash that manages the
int[] holding the bytesStart offsets. TermsHashPerField subclasses and manages
the ParallelPostingsArray through it.
* remove remaining no-commits
* made RecyclingbyteBlockAllocator synced by default (we use synchronized
methods for it now)
I run a quick Wikipedia 100k docs benchmark against trunk vs. LUCENE-2662 and
the results are promising.
|version|rec/sec|elapsed sec|avgUsedMem|
|LUCENE-2662|717.30|139.41|536,682,592|
|trunk| 682.66|146.49|546,065,344|
I will run the 10M benchmark once I get back to this.
> BytesHash
> ---------
>
> Key: LUCENE-2662
> URL: https://issues.apache.org/jira/browse/LUCENE-2662
> Project: Lucene - Java
> Issue Type: Improvement
> Components: Index
> Affects Versions: Realtime Branch, 4.0
> Reporter: Jason Rutherglen
> Assignee: Simon Willnauer
> Priority: Minor
> Fix For: Realtime Branch, 4.0
>
> Attachments: LUCENE-2662.patch, LUCENE-2662.patch, LUCENE-2662.patch,
> LUCENE-2662.patch, LUCENE-2662.patch
>
>
> This issue will have the BytesHash separated out from LUCENE-2186
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]