Re: Lucene62Codec Overwrites Payloads Across the Documents

2020-02-29 Thread Ivan Provalov
The test was not using the payload offsets causing this behavior.  I updated the test and added a couple more interesting use cases.  Closed the JIRA.   On Friday, February 28, 2020, 09:36:20 AM PST, Ivan Provalov wrote: Erick, I opened a JIRA for this:  https://issues.apache.org

Re: Lucene62Codec Overwrites Payloads Across the Documents

2020-02-28 Thread Ivan Provalov
Erick, I opened a JIRA for this:  https://issues.apache.org/jira/browse/SOLR-14293 Thanks! Ivan - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Lucene62Codec Overwrites Payloads Across the Documents

2020-02-28 Thread Ivan Provalov
On Friday, February 28, 2020, 09:02:12 AM PST, Ivan Provalov wrote: Thanks, Erick!  Understood. On Friday, February 28, 2020, 08:24:45 AM PST, Erick Erickson wrote: Ivan: Long time! If the test fails against 8x or master, certainly raise a JIRA. But there’s little

Re: Lucene62Codec Overwrites Payloads Across the Documents

2020-02-28 Thread Ivan Provalov
AM, Ivan Provalov > wrote: > > I noticed a weird payload behavior with Solr 6.3.0.  After writing the > Lucene62Codec specific unit test (attached) I think there could be a bug > which allows for the same term payloads to be written into another document's > same t

Re: Lucene62Codec Overwrites Payloads Across the Documents

2020-02-28 Thread Ivan Provalov
03:09:15 AM PST, Alan Woodward wrote: Your TokenStreamComponents object is getting re-used, so only the first PayloadData object gets referenced by the PayloadFilter. > On 28 Feb 2020, at 06:55, Ivan Provalov wrote: > > I tested these versions and I can reproduce for each one

Re: Lucene62Codec Overwrites Payloads Across the Documents

2020-02-27 Thread Ivan Provalov
, February 27, 2020, 09:45:32 PM PST, Ivan Provalov wrote: I noticed a weird payload behavior with Solr 6.3.0.  After writing the Lucene62Codec specific unit test (attached) I think there could be a bug which allows for the same term payloads to be written into another document's same

Lucene62Codec Overwrites Payloads Across the Documents

2020-02-27 Thread Ivan Provalov
I noticed a weird payload behavior with Solr 6.3.0.  After writing the Lucene62Codec specific unit test (attached) I think there could be a bug which allows for the same term payloads to be written into another document's same term payload (or the second payload for the second document being ski

[jira] [Commented] (LUCENE-7321) Character Mapping

2018-08-27 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16594154#comment-16594154 ] Ivan Provalov commented on LUCENE-7321: --- [~arafalov], the clean use case is

[jira] [Commented] (LUCENE-7321) Character Mapping

2018-08-27 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16593976#comment-16593976 ] Ivan Provalov commented on LUCENE-7321: --- [~erickerickson],  Good questions:

[jira] [Commented] (LUCENE-7321) Character Mapping

2018-08-27 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16593779#comment-16593779 ] Ivan Provalov commented on LUCENE-7321: --- [~erike4...@yahoo.com], any progres

[jira] [Commented] (LUCENE-8131) Kuromoji User Dictionary Resources Not Closed

2018-01-15 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16326653#comment-16326653 ] Ivan Provalov commented on LUCENE-8131: --- Thanks, [~hossman]! > Kuromo

[jira] [Created] (LUCENE-8131) Kuromoji User Dictionary Resources Not Closed

2018-01-15 Thread Ivan Provalov (JIRA)
Ivan Provalov created LUCENE-8131: - Summary: Kuromoji User Dictionary Resources Not Closed Key: LUCENE-8131 URL: https://issues.apache.org/jira/browse/LUCENE-8131 Project: Lucene - Core

[jira] [Updated] (SOLR-9942) MoreLikeThis Performance Degraded With Filtered Query

2017-01-07 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-9942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated SOLR-9942: Attachment: (was: solr_mlt_test.tar) > MoreLikeThis Performance Degraded With Filtered Qu

[jira] [Updated] (SOLR-9942) MoreLikeThis Performance Degraded With Filtered Query

2017-01-07 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-9942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated SOLR-9942: Attachment: solr_mlt_test2.tar test case > MoreLikeThis Performance Degraded With Filtered Qu

[jira] [Updated] (SOLR-9942) MoreLikeThis Performance Degraded With Filtered Query

2017-01-07 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-9942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated SOLR-9942: Description: Without any filters, the MLT is performing normal. With any added filters, the

[jira] [Updated] (SOLR-9942) MoreLikeThis Performance Degraded With Filtered Query

2017-01-07 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-9942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated SOLR-9942: Description: Without any filters, the MLT is performing normal. With any added filters, the

[jira] [Updated] (SOLR-9942) MoreLikeThis Performance Degraded With Filtered Query

2017-01-07 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-9942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated SOLR-9942: Attachment: solr_mlt_test.tar test for mlt performance issue > MoreLikeThis Performance Degra

[jira] [Created] (SOLR-9942) MoreLikeThis Performance Degraded With Filtered Query

2017-01-07 Thread Ivan Provalov (JIRA)
Ivan Provalov created SOLR-9942: --- Summary: MoreLikeThis Performance Degraded With Filtered Query Key: SOLR-9942 URL: https://issues.apache.org/jira/browse/SOLR-9942 Project: Solr Issue Type

[jira] [Created] (SOLR-9730) IndexSchema Dynamic Field Definition Caching

2016-11-04 Thread Ivan Provalov (JIRA)
Ivan Provalov created SOLR-9730: --- Summary: IndexSchema Dynamic Field Definition Caching Key: SOLR-9730 URL: https://issues.apache.org/jira/browse/SOLR-9730 Project: Solr Issue Type

[jira] [Commented] (LUCENE-7486) DisjunctionMaxScorer Initializes scoreMax to Zero Preventing From Using Negative Scores

2016-10-11 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15566543#comment-15566543 ] Ivan Provalov commented on LUCENE-7486: --- +1 > DisjunctionMaxScorer Init

[jira] [Commented] (LUCENE-7486) DisjunctionMaxScorer Initializes scoreMax to Zero Preventing From Using Negative Scores

2016-10-10 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15563360#comment-15563360 ] Ivan Provalov commented on LUCENE-7486: --- Thanks, Uwe! > DisjunctionMa

[jira] [Issue Comment Deleted] (LUCENE-7486) DisjunctionMaxScorer Initializes scoreMax to Zero Preventing From Using Negative Scores

2016-10-10 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated LUCENE-7486: -- Comment: was deleted (was: Thanks, Uwe!) > DisjunctionMaxScorer Initializes scoreMax to Z

[jira] [Commented] (LUCENE-7486) DisjunctionMaxScorer Initializes scoreMax to Zero Preventing From Using Negative Scores

2016-10-10 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15563361#comment-15563361 ] Ivan Provalov commented on LUCENE-7486: --- Thanks, Uwe! > DisjunctionMa

[jira] [Commented] (LUCENE-7486) DisjunctionMaxScorer Initializes scoreMax to Zero Preventing From Using Negative Scores

2016-10-10 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15562771#comment-15562771 ] Ivan Provalov commented on LUCENE-7486: --- Good point, Uwe. Is there a reaso

[jira] [Created] (LUCENE-7486) DisjunctionMaxScorer Initializes scoreMax to Zero Preventing From Using Negative Scores

2016-10-10 Thread Ivan Provalov (JIRA)
Ivan Provalov created LUCENE-7486: - Summary: DisjunctionMaxScorer Initializes scoreMax to Zero Preventing From Using Negative Scores Key: LUCENE-7486 URL: https://issues.apache.org/jira/browse/LUCENE-7486

[jira] [Commented] (LUCENE-7321) Character Mapping

2016-06-08 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15321865#comment-15321865 ] Ivan Provalov commented on LUCENE-7321: --- Koji, this one works on a token l

[jira] [Updated] (LUCENE-7321) Character Mapping

2016-06-08 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated LUCENE-7321: -- Description: One of the challenges in search is recall of an item with a common typing

[jira] [Updated] (LUCENE-7321) Character Mapping

2016-06-08 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated LUCENE-7321: -- Attachment: CharacterMappingComponent.pdf Detail component description. > Character Mapp

[jira] [Updated] (LUCENE-7321) Character Mapping

2016-06-08 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated LUCENE-7321: -- Attachment: LUCENE-7321.patch Initial patch. > Character Mapp

[jira] [Created] (LUCENE-7321) Character Mapping

2016-06-07 Thread Ivan Provalov (JIRA)
Ivan Provalov created LUCENE-7321: - Summary: Character Mapping Key: LUCENE-7321 URL: https://issues.apache.org/jira/browse/LUCENE-7321 Project: Lucene - Core Issue Type: New Feature

Re: LUCENE-3687: Allow similarity to encode norms other than a single byte

2015-10-05 Thread Ivan Provalov
, 2015 6:56 AM, Ivan Provalov wrote: Mikhail, Thank you for your reply. Even though the long is returned from this function, it is always encoded as a single byte lossy representation.  In order to change that and add other norms (for using other similarity functions on the same indexed

Re: LUCENE-3687: Allow similarity to encode norms other than a single byte

2015-10-04 Thread Ivan Provalov
ex. */ public abstract long encodeNormValue(float f); On Sun, Oct 4, 2015 at 6:39 AM, Ivan Provalov wrote: When does this 4.0-ALPHA feature going to be included in the released version? >https://issues.apache.org/jira/browse/LUCENE-3687 >It's the "Allow similarity to encode

LUCENE-3687: Allow similarity to encode norms other than a single byte

2015-10-03 Thread Ivan Provalov
's only on 4.0-alpha.  I am using 4.6.1, but also looked in 5.3.1 source, none of these include the changes.   With these changes, the new API in the sim class is accepting the norms, like so: computeNorm(FieldInvertState state, Norm norm).  Thank you, Ivan Provalov

[jira] [Commented] (SOLR-3931) Turn off coord() factor for scoring

2015-09-30 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14938785#comment-14938785 ] Ivan Provalov commented on SOLR-3931: - Ideally, I would like to plug in a simila

[jira] Updated: (LUCENE-474) High Frequency Terms/Phrases at the Index level

2010-08-22 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated LUCENE-474: - Attachment: (was: collocations.zip) > High Frequency Terms/Phrases at the Index le

[jira] Updated: (LUCENE-474) High Frequency Terms/Phrases at the Index level

2010-08-22 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated LUCENE-474: - Attachment: collocations.zip > High Frequency Terms/Phrases at the Index le

[jira] Updated: (LUCENE-474) High Frequency Terms/Phrases at the Index level

2010-08-22 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated LUCENE-474: - Attachment: (was: collocations.zip) > High Frequency Terms/Phrases at the Index le

[jira] Updated: (LUCENE-474) High Frequency Terms/Phrases at the Index level

2010-08-22 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated LUCENE-474: - Attachment: collocations.zip Included the scoring in the CollocationsSearcher which now will

[jira] Updated: (LUCENE-474) High Frequency Terms/Phrases at the Index level

2010-08-21 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Provalov updated LUCENE-474: - Attachment: collocations.zip I saw some activity on the term collocations in the lucene user

[jira] Commented: (LUCENE-2458) queryparser shouldn't generate phrasequeries based on term count

2010-05-12 Thread Ivan Provalov (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1285#action_1285 ] Ivan Provalov commented on LUCENE-2458: --- Robert has asked me to post our