Grant Ingersoll's 2009 blog article- is there a newer version?

2018-07-05 Thread baris . kazar
Hi,- Is there a newer version of this great article from Mr. Grant Ingersoll? https://lucidworks.com/2009/08/05/getting-started-with-payloads/ Thanks This article is based on Lucene 2.9. Best regards - To unsubscribe, e-mail:

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

2018-07-05 Thread baris . kazar
Thanks i saw these posts but Grant's article is based on Lucene. i am not using Solr. Many classes in that article does not exist in latest versions of Lucene like version 6.1. For instance BoostingTermQuery does not exist in 6.1 and the way docs are indexed are also different on 6.1. There

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

2018-07-05 Thread baris . kazar
Sure, can You please point me to the location under Lucene Solr? In Grant's article: i want to know the need to use BoostingTermQuery (now in latest version PayloadScoreQuery) where we already specify payloads in the data in the form |. Best regards On 7/5/18 11:41 AM, Erick Erickson wr

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

2018-07-05 Thread baris . kazar
i mean i know the function of BoostingTermQuery class: The BoostingTermQuery is very similar to the SpanTermQuery except that it factors in the value of the payload located at each of the positions where the Term occurs. In order to take advantage of this, you must override Similarity.scoreP

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

2018-07-05 Thread baris . kazar
org.apache.lucene.index.IndexWriter class does not have setSimilarity method, am i missing something for this? i checked multiple Lucene versions. next, i have this problem: After defining the Analyzer as the PayloadAnalyzer like on the blog mentioned before, i declared org.apache.lucene.s

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

2018-07-05 Thread baris . kazar
One thing i noticed is that org.apache.lucene.index.IndexWriter class does not have setSimilarity and it is moved to org.apache.lucene.index.IndexWriterConfig class. thus, i resolved the first question below. Best regards On 7/5/18 3:17 PM, baris.ka...@oracle.com wrote: org.apache.lucene.in

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

2018-07-05 Thread baris . kazar
Another piece resolved: DefaultSimilarity is now ClassicSimilarity since LUCENE VERSION 5.4. Best regards On 7/5/18 4:30 PM, baris.ka...@oracle.com wrote: One thing i noticed is that org.apache.lucene.index.IndexWriter class does not have setSimilarity and it is moved to org.apache.lucene.i

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

2018-07-06 Thread baris . kazar
can someone point to any usage of PayloadScoreQuery class from Lucene please? Best regards On 7/5/18 4:49 PM, baris.ka...@oracle.com wrote: Another piece resolved: DefaultSimilarity is now ClassicSimilarity since LUCENE VERSION 5.4. Best regards On 7/5/18 4:30 PM, baris.ka...@oracle.com w

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

2018-07-08 Thread Baris Kazar
it seems with PayloadScoreQuery, QueryParser is also supported right for searching/indexing ***multiple*** terms. However, there are no examples in the Javadocs. @Grant Can You please publish new version of Your great blog article? Thanks very much - Original Message - From: baris.ka

Re: Grant Ingersoll's 2009 blog article- is there a newer version?

2018-07-14 Thread baris . kazar
the first field first, then if not successful, do partial matching on both fields._ i also read https://lucene.apache.org/core/6_0_1/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html how can i achieve this in lucene? Best regards On 7/8/18 4:08 PM, Baris Kazar wrote: it seems

Lucene Scoring

2018-07-15 Thread Baris Kazar
Since i cant use payloads, i tried another way: i decomposed my text into two fields and i want the first field to be searched until nothing is found/matched there. then i want the second field to be searched. currently i added field.setBoost but that does not help. i see scores being modif

Lucene scoring components

2018-07-17 Thread baris . kazar
Hi,- is there a way to diminish the tf(t in d) component to 1? i dont want the number of times a word appears to affect the scoring for my app. Best regards - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org

Re: Lucene scoring components

2018-07-17 Thread baris . kazar
i forgot to put the doc that i was referring to: https://lucene.apache.org/core/6_0_1/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html Best regards On 7/17/18 1:01 PM, baris.ka...@oracle.com wrote: Hi,- is there a way to diminish the tf(t in d) component to 1? i dont want th

Lucene scoring overall score

2018-07-17 Thread baris . kazar
Hi,- how can i check the contributions from different fields indexed in the hits doc's score? Best regards - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lu

Lucene BooleanQuery with some TermQuery's having BooleanClause.Occur set MUST for all

2018-07-18 Thread baris . kazar
Hi,- i have an indexed field having "$word1 word2" and i want to find the docs having these two words first in my first query. i have another indexed field but i am not searching on that second field for this first query which is BooleanQuery with two TermQuery's having BooleanClause.Occu

Re: Not getting desired result through TermQuery

2018-07-18 Thread baris . kazar
My problem seems similar to this one. i make sure index has all lower cased and TermQuery search term also gets all lower cased. i tokenize the search string since index uses standardtokenizer and standardfilter and lowecasefilter and asciifoldingfilter. My index uses standardtokenizer and

does $ mean something in Lucene index and MultiFieldQueryParser

2018-07-18 Thread baris . kazar
It seems in my query string i cant see $ when print it out from MultiFieldQueryParser but the search string has $ in it and it finds hits. On the other hand, Termquery based BooleanQuery keeps $ and no hits. i use $ for starts with effect. Best regards ---

"java.lang.IllegalArgumentException: Could not load codec 'Lucene60'. Did you forget to add lucene-backward-codecs.jar?" Error

2018-07-28 Thread Baris Kazar
Lucene 6.1.0 is ok but Lucene 6.6.0 issues an error related to codecs. i dont want to modify META-INF. https://bugzilla.redhat.com/show_bug.cgi?id=1352985 Any suggestions please? Best regards - To unsubscribe, e-mail: java-user-

Re: "java.lang.IllegalArgumentException: Could not load codec 'Lucene60'. Did you forget to add lucene-backward-codecs.jar?" Error

2018-07-28 Thread Baris Kazar
Caused By: java.lang.IllegalArgumentException: An SPI class of type org.apache.lucene.codecs.Codec with name 'Lucene60' does not exist. You need to add the corresponding JAR file supporting this SPI to your classpath. The current classpath supports the following names: [Lucene62] Any suggesti

Re: "java.lang.IllegalArgumentException: Could not load codec 'Lucene60'. Did you forget to add lucene-backward-codecs.jar?" Error

2018-07-28 Thread Baris Kazar
Ok, resolved. i thought i included 6.6.0 in the path but 6.1.0 was still there. Thanks - Original Message - From: baris.ka...@oracle.com To: java-user@lucene.apache.org Sent: Saturday, July 28, 2018 2:45:08 PM GMT -05:00 US/Canada Eastern Subject: Re: "java.lang.IllegalArgumentException: C

Re: "java.lang.IllegalArgumentException: Could not load codec 'Lucene60'. Did you forget to add lucene-backward-codecs.jar?" Error

2018-07-28 Thread Baris Kazar
well, fixing path did not fix this problem Caused By: java.lang.IllegalArgumentException: An SPI class of type org.apache.lucene.codecs.Codec with name 'Lucene60' does not exist. You need to add the corresponding JAR file supporting this SPI to your classpath. The current classpath supports th

Re: "java.lang.IllegalArgumentException: Could not load codec 'Lucene60'. Did you forget to add lucene-backward-codecs.jar?" Error

2018-07-28 Thread Baris Kazar
Ok, found the problem Lucene60 codec classes does not exist in Lucene 6.6.0 core jar: 297:org/apache/lucene/codecs/lucene60/Lucene60FieldInfosFormat$1.class 298:org/apache/lucene/codecs/lucene60/Lucene60FieldInfosFormat.class 299:org/apache/lucene/codecs/lucene60/Lucene60PointsFormat.class 300:org

Re: "java.lang.IllegalArgumentException: Could not load codec 'Lucene60'. Did you forget to add lucene-backward-codecs.jar?" Error

2018-07-28 Thread Baris Kazar
is there a packaging issue but i also checked next Lucene versions and they have the same structure ie no Lucene60 codec classes. Best regards - Original Message - From: baris.ka...@oracle.com To: java-user@lucene.apache.org Sent: Saturday, July 28, 2018 4:13:28 PM GMT -05:00 US/Canada Ea

Re: "java.lang.IllegalArgumentException: Could not load codec 'Lucene60'. Did you forget to add lucene-backward-codecs.jar?" Error

2018-07-28 Thread Baris Kazar
-backward-codecs.jar?" Error Just do what the error message tells you to do ... Baris Kazar schrieb am Sa., 28. Juli 2018, 22:41: > is there a packaging issue but i also checked next Lucene versions and > they have the same structure ie no Lucene60 codec classes. > Best regards >

any example on FunctionScoreQuery since Field.setBoost is deprecated with Lucene 6.6.0

2018-07-31 Thread baris . kazar
The following page says: http://lucene.apache.org/core/6_6_0/core/org/apache/lucene/document/Field.html#setBoost-float- setBoost @Deprecated public void setBoost(float boost) Deprecated. Index-time boosts are deprecated, please index index-time scoring factors into a doc value field and combine

Re: any example on FunctionScoreQuery since Field.setBoost is deprecated with Lucene 6.6.0

2018-07-31 Thread baris . kazar
https://stackoverflow.com/questions/50952727/ho-to-use-functionscorequery-with-text-fields Somebody else was also asking this. Best regards On 7/31/18 3:56 PM, baris.ka...@oracle.com wrote: The following page says: https://urldefense.proofpoint.com/v2/url?u=http-3A__lucene.apache.org_core_6-

Re: any example on FunctionScoreQuery since Field.setBoost is deprecated with Lucene 6.6.0

2018-07-31 Thread baris . kazar
Robert, thanks, this is helpful but why did this change when it was great to use? SortField is a new concept to me and i am not sure this is avail in 6.6.0 but will check. This new way seems more tricky. if there are more examples, i will be happier :) Best regards On 7/31/18 6:19 PM, Ro

Re: any example on FunctionScoreQuery since Field.setBoost is deprecated with Lucene 6.6.0

2018-08-01 Thread baris . kazar
i have one MultiFieldQueryParser based Query and one BooleanQuery with dynamic number of TermQuery's. i dont have SortField objects. Are there examples with the above query types please? Best regards On 7/31/18 7:59 PM, baris.ka...@oracle.com wrote: Robert, thanks, this is helpful but why

Re: any example on FunctionScoreQuery since Field.setBoost is deprecated with Lucene 6.6.0

2018-08-01 Thread baris . kazar
These methods look promising in terms of ease of use of FunctionscoreQuery but they dont exist in Lucene 6.6.0. static FunctionScoreQueryboostByQuery(Query in, Query boostMatch, float boostValue) Returns a FunctionScoreQuery where the scores of a wrapped query are multiplied by a boost fac

Re: any example on FunctionScoreQuery since Field.setBoost is deprecated with Lucene 6.6.0

2018-08-01 Thread baris . kazar
Someone else also asked: http://lucene.472066.n3.nabble.com/index-time-field-level-boosting-in-6-6-0-td4351797.html Best regards On 8/1/18 5:12 PM, baris.ka...@oracle.com wrote: These methods look promising in terms of ease of use of FunctionscoreQuery but they dont exist in Lucene 6.6.0. sta

How does MultiFieldQueryParser work? Any example showing its mechanism please

2018-08-24 Thread baris . kazar
Hi,- i am trying to understand what this api does and it seems it decomposes the input string by spaces and OR's them and then depending on occur field setting the whole OR'ed expression is either under "+" (i.e,must) or just OR'ed again. Such as: input string abc def occur values are : m

Re: Any way to improve document fetching performance?

2018-08-27 Thread baris . kazar
Alex,- how big are those docs? Best regards On 8/27/18 10:09 AM, alex stark wrote: Hello experts, I am wondering is there any way to improve document fetching performance, it appears to me that visiting from store field is quite slow. I simply tested to use indexsearch.doc() to get 2000 do

Re: Any way to improve document fetching performance?

2018-08-27 Thread baris . kazar
yes, it should be less than a ms actually for those type of files. index and search on the same machine? no net latency in between? Best On 8/27/18 10:14 AM, alex stark wrote: quite small, just serveral simple short text store fields. The total index size is around 1 GB (2m doc). On Mo

Re: Any way to improve document fetching performance?

2018-08-27 Thread baris . kazar
can you post your query string? Best On 8/27/18 10:33 AM, alex stark wrote: In same machine, no net latency. When I reduce to 500 limit, it takes 20ms, which is also slower than I expected. btw, indexing is stopped. On Mon, 27 Aug 2018 22:17:41 +0800 wrote yes, it should be less

Re: Create Best Query For My Case

2018-08-27 Thread baris . kazar
Taha,- can you please simplify your case? You need to narrow down more. Best regards - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Create Best Query For My Case

2018-08-28 Thread baris . kazar
Are You using MultiFieldQueryParser? i see similar behavior for that parser. That is why i asked for your query string, lets look at Your query string and which parser You are using. Best regards On 8/28/18 8:02 AM, thturk wrote: Occur.MUST is working but problem for my case is when Occur.S

Lucene finding partial word matchings

2018-08-30 Thread baris . kazar
Hi,- how can i do this with Lucene (not Solr)? Best regards - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Lucene finding partial word matchings

2018-08-30 Thread baris . kazar
i can just add "*" at the end of the search string and it works ok. http://lucene.apache.org/core/3_1_0/queryparsersyntax.html Best On 8/30/18 10:01 AM, baris.ka...@oracle.com wrote: Hi,- how can i do this with Lucene (not Solr)? Best regards --

SynonymMap

2018-09-06 Thread Baris Kazar
Hi,- how does SynonymMap deal with repeated values? Best regards - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

SynonymGraphFilter

2018-09-10 Thread baris . kazar
https://lucene.apache.org/core/6_4_1/analyzers-common/org/apache/lucene/analysis/synonym/SynonymGraphFilter.html Does this mean i dont have to repeat it in the search analyzer when i do this at indexing time? Best regards -

Re: SynonymGraphFilter

2018-09-10 Thread baris . kazar
Any examples on this? i think it would be nice if Javadocs had an example on this: However, if you use this during indexing, you must follow it with FlattenGraphFilter to squash tokens on top of one another like SynonymFilter, because the indexer can't directly consume a graph. To get fully c

Re: SynonymMap

2018-09-10 Thread baris . kazar
://urldefense.proofpoint.com/v2/url?u=http-3A__blog.mikemccandless.com&d=DwIBaQ&c=RoP1YumCXCgaWHvlZYR8PZh8Bv7qIrMUB65eapI_JnE&r=nlG5z5NcNdIbQAiX-BKNeyLlULCbaezrgocEvPhQkl4&m=dsASFl-pzyOvkKrtvEgZb9GCCgOES1PuLqYds9VH6GI&s=1g1QvDMFYT_gctvteGesGu8v4ESORDHlGzgdOiMQAxE&e= On Thu, Sep 6, 2018

SynonymMap.Builder.add method

2018-09-10 Thread baris . kazar
i am trying to understand the add method here https://lucene.apache.org/core/6_4_1/analyzers-common/org/apache/lucene/analysis/synonym/SynonymMap.Builder.html /public void add(CharsRef input,// //CharsRef output,// //boolean includeOrig)// //Add a phrase->phrase

Re: SynonymGraphFilter

2018-09-11 Thread baris . kazar
Any comments please? Thanks On 9/10/18 5:07 PM, baris.ka...@oracle.com wrote: Any examples on this? i think it would be nice if Javadocs had an example on this: However, if you use this during indexing, you must follow it with FlattenGraphFilter to squash tokens on top of one another like

Re: SynonymMap.Builder.add method

2018-09-11 Thread baris . kazar
Any comments please? Thanks On 9/10/18 5:21 PM, baris.ka...@oracle.com wrote: i am trying to understand the add method here https://urldefense.proofpoint.com/v2/url?u=https-3A__lucene.apache.org_core_6-5F4-5F1_analyzers-2Dcommon_org_apache_lucene_analysis_synonym_SynonymMap.Builder.html&d=DwI

Re: SynonymGraphFilter

2018-09-11 Thread baris . kazar
Mike,- Great article, thanks for that; and i was exactly thinking about reverse mapping when i was writing this question. i guess Lucene would be nicer to both mappings when one is called for or another parameter to activate this double mapping. My next question is: can a synonmy be separ

Re: SynonymMap.Builder.add method

2018-09-11 Thread baris . kazar
Thanks Mike. On 9/11/18 1:38 PM, Michael McCandless wrote: That's correct. When the input sequence is seen during tokenization, the synonym (graph) filter will also insert the output tokens into the TokenStream, as if they "naturally" occurred. Mike McCandless https://urldefense.proofpoint.

Re: SynonymGraphFilter

2018-09-12 Thread baris . kazar
Any examples on the following note on the Javadocs at https://lucene.apache.org/core/6_4_1/analyzers-common/org/apache/lucene/analysis/synonym/SynonymGraphFilter.html Quoted from the above url: */However, if you use this during indexing, you must follow it with FlattenGraphFilter to squash to

Re: SynonymGraphFilter

2018-09-12 Thread baris . kazar
So, the below statement suggests this? "To get fully correct positional queries when your synonym replacements are multiple tokens, you should instead apply synonyms using this TokenFilter at query time and translate the resulting graph to a TermAutomatonQuery e.g. using TokenStreamToTermAuto

Re: SynonymGraphFilter

2018-09-13 Thread baris . kazar
Thanks Michael. I think this clears my questions. Best regards On 9/12/18 8:23 PM, Michael Sokolov wrote: Usually one will either apply synonyms at index time or apply them at query time, but not both. I think the situation is that you will get most correct behavior, respecting synonym graph s

MultiPhraseQuery

2018-09-18 Thread baris . kazar
Hi,-  how does MultiPhraseQuery treat synonyms? is the following possible? ... (created index with synonyms and indexReader object has the index) IndexSearcher is = new IndexSearcher(indexReader); MultiPhraseQuery.Builder builder = new MultiPhraseQuery.Builder(); builder.add(new Term("body",

Re: MultiPhraseQuery

2018-09-18 Thread baris . kazar
Trying to implement the example on https://lucene.apache.org/core/6_6_1/core/org/apache/lucene/search/MultiPhraseQuery.html // A generalized version of PhraseQuery, with the possibility of adding more than one term at the same position that are treated as a disjunction (OR). To use this class

Re: MultiPhraseQuery

2018-09-18 Thread baris . kazar
Any suggestions please? Two main questions: - how do synonyms get utilized by MultiPhraseQuery? - how do we get second token "app" applied to the example on MultiPhraseQuery javadocs page? (and how do we get Terms[] array from Terms object?) Now three questions :) i wish the Javadocs has exam

Re: MultiPhraseQuery

2018-09-18 Thread baris . kazar
Erick,-  i think the reason why MultiPhraseQuery was created was synonyms as far as i understood. am i right? i want to have a BooleanQuery or MultiPhraseQuery (i cant decide between these two) with an index which considers synonyms already. One disadvantage of MultiPhraseQuery is that it need

Re: MultiPhraseQuery

2018-09-18 Thread baris . kazar
FuzzyQuery seems also not suitable for me. PrefixQuery can be one token only, right? Best On 9/18/18 5:23 PM, baris.ka...@oracle.com wrote: Erick,-  i think the reason why MultiPhraseQuery was created was synonyms as far as i understood. am i right? i want to have a BooleanQuery or MultiPh

Re: MultiPhraseQuery

2018-09-18 Thread baris . kazar
Ok, Mike, that was very helpful. Now, i think i should use BooleanQuery with PhraseQueries but will PhraseQuery be able to handle all synonyms- multi or single term? What is the best way for this: i have multiple tokens and i want to be able to do a cheap fuzzy search. Best regards On 9/18

MultiPhraseQuery or PhraseQuery to take the synonyms into account?

2018-09-20 Thread baris . kazar
Hi,-  should i use MultiPhraseQuery or PhraseQuery to take synonyms into account? Best regards baris - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.a

An example for creating SynonymMap Object?

2018-10-15 Thread baris . kazar
Hi,-  is there any example on Lucene domain for creating SynonymMap Object? Best regards - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: An example for creating SynonymMap Object?

2018-10-15 Thread baris . kazar
Thanks Markus. i am trying to see whether i need to add abc-> 123 and also 123-> abc via builder.add. Best regards On 10/15/18 3:28 PM, Markus Jelsma wrote: Hello Baris, Check out the filter factory and the map parser for a more low level example: https://urldefense.proofpoint.com/v2/url

Re: Ignoring “de la” at index or search time

2019-02-25 Thread baris . kazar
Thanks Erick, that was very helplful. Now, i see what you mean by at the begining of this thread: stopwords are less of a concern Now, may i ask the following related question? QueryParser parser = new QueryParser(columns[0], analyzer) ; Query query5 = parser.parse(q+"~"); i see the query 

Re: Ignoring “de la” at index or search time

2019-02-25 Thread baris . kazar
Ok, found answer to this question: parser.setPhraseSlop(slopValue); Thanks On 2/25/19 11:43 AM, baris.ka...@oracle.com wrote: Thanks Erick, that was very helplful. Now, i see what you mean by at the begining of this thread: stopwords are less of a concern Now, may i ask the following re

Re: Ignoring “de la” at index or search time

2019-03-01 Thread baris . kazar
this did not work, any suggestions please? QueryParser parser = new QueryParser(columns[0], analyzer) ; Query query5 = parser.parse(q+"~"); i cant set the slop value like parser.setPhraseSlop(slopValue); i still see the query printed as with value 2: Query5:: :~2 Best regards On 2/25/19

Lucene FuzzyQuery

2019-06-07 Thread baris . kazar
May i ask this question about FuzzyQuery? When i select BooleanClause.Occur.MUST for a FuzzyQuery i dont get any results back but with Occur.SHOULD i get some results, though. what is the meaning of Occur.MUST for FuzzyQuery? I want Lucene to give me best result from FuzzyQuery, i thought th

Re: Lucene FuzzyQuery

2019-06-07 Thread baris . kazar
Sure, i will try that: and lets mark this problem as problem#1. something else is also happening: lets mark this as problem#2. i make sure i specify a string with 1 edit away misspelled and that never gets hit but the word with correct spelling is in the index. Best regards On 6/7/19 2:02

Re: Lucene FuzzyQuery

2019-06-07 Thread baris . kazar
They only differ in 1 char and 5 letters long one 4 letters long the correct word: Mains vs Main (correct) FuzzyQuery Mains~2 cannot catch anything with Main in the first 74 hits. the 75th hit has Main word. Best regards On 6/7/19 2:44 PM, Atri Sharma wrote: i make sure i specify a string

Re: Lucene FuzzyQuery

2019-06-07 Thread baris . kazar
For problem#1 the following fuzzyquery ALONE does not catch any results when i run it  alone in the BooleanQuery and with either SHOULD or MUST occurrence. such as: [contentDFLT:MAIN~2] For problem#1 again the query is as follows: [+contentDFLT:"street", ->PhraseQuery contentDFLT:"mains~2",

FuzzyQuery

2019-06-07 Thread Baris Kazar
Hi,- i cant get FuzzyQuery working for searching with a query like Mains~2 to find the word Main in a TextField. Any suggestions please? - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands,

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
How do i check how it is indexed? lowecase or uppercase? only way is now to by testing. i am using standardanalyzer. Best regards On 6/9/19 11:57 AM, Atri Sharma wrote: On Sun, Jun 9, 2019 at 8:53 PM Tomoko Uchida wrote: Hi, What analyzer do you use for the text field? Is the term "Main"

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
i am using standardanalyzer. Best regards On 6/9/19 11:22 AM, Tomoko Uchida wrote: Hi, What analyzer do you use for the text field? Is the term "Main" correctly indexed? 2019年6月8日(土) 9:13 Baris Kazar : Hi,- i cant get FuzzyQuery working for searching with a query like Mains

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
i dont know how to use Fuzzyquery with queryparser but probably You are suggesting QueryParser parser = new QueryParser(field, analyzer) ; Query query = parser.parse("MAINS~2"); booleanQuery.add(query, BooleanClause.Occur.SHOULD); am i right? Best regards On 6/10/19 10:47 AM, Atri Sharma wro

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
why cant the second set not work at all? it is indexed as Textfield like street="..." city="..." etc. Best regards On 6/10/19 11:23 AM, baris.ka...@oracle.com wrote: i dont know how to use Fuzzyquery with queryparser but probably You are suggesting QueryParser parser = new QueryParser(fiel

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
booleanQuery.add(Utils.createPhraseQuery(phraseAnalyzer, field, "city=\"NASHUA\""), BooleanClause.Occur.MUST);     booleanQuery.add(Utils.createPhraseQuery(phraseAnalyzer, field, "region=\"NEW HAMPSHIRE\""), BooleanClause.Occur.MUST);     booleanQuery.add(Utils.createPhraseQuery(phraseAna

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
[+contentDFLT:"city nashua", +contentDFLT:"region new-hampshire", +contentDFLT:"country united states", contentDFLT:street contentDFLT:mains] QueeryParser chops it into two pieces from parser.parser("street=\"MAINS\""); Index has a TextField named contentDFLT the following data : street="MAIN

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
Somehow " is causing an issue as this should return street with MAIN: [contentDFLT:street="MAINS"~2, +contentDFLT:"city nashua", +contentDFLT:"region new-hampshire", +contentDFLT:"country united states"] Best regards On 6/10/19 2:24 PM, baris.ka...@oracle.com wrote: [+contentDFLT:"city nashu

Re: FuzzyQuery

2019-06-11 Thread baris . kazar
i can say that quotes is not the issue with index as it still results in same results with quotes or without quotes. i am starting to feel that this might be a bug maybe?? Best regards On 6/10/19 2:46 PM, baris.ka...@oracle.com wrote: Somehow " is causing an issue as this should return stree

Re: FuzzyQuery

2019-06-12 Thread baris . kazar
Tomoko,-  Thank You for Your suggestions. i am trying to understand it and i thought i did :) but it does not work with FuzzyQuery when i used with a *single* large TextField like street=...value... city=...value... region=...value... country=...value... (with or without quotes for the value

Re: FuzzyQuery- why is it ignored?

2019-06-12 Thread baris . kazar
Hi again,- this is really interesting and i hope i am missing something. Index small cases all entries so case sensitivity is not an issue i think. Case #1: org.apache.lucene.queryparser.classic.QueryParser parser = new org.apache.lucene.queryparser.classic.QueryParser(field, phraseAnalyzer)

Re: FuzzyQuery- why is it ignored?

2019-06-12 Thread baris . kazar
Ok, i think only this very specific only "mains" has an issue. all i knew about Lucene was fine :) Great... i have one more question: which one is advised to use: FuzzyQuery or the Query.parser with search string~ appended? The second one will go through analyzer and make search string lower

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
Tomoko,-  That is strange indeed. Something is wrong when i use mains but maink, mainl, mainr,mainq, maint all work ok any consonant at the end except s works in this case. Case #3 had +contentDFLT:mains~2 but not +contentDFLT:"mains~2". i am using fuzzy query with ~ from Query.builder and t

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
Erick, Cool, could You give a simple example with my example please? Best regards On 6/13/19 10:12 AM, Erick Erickson wrote: Shot in the dark: stemming. Whenever I see a problem with something ending in “s” (or “er” or “ing” or….) my first suspect is that stemming is turned on. In that cas

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
does it consider it as like plural word? :) :) :) That makes sense. Best regards On 6/13/19 10:31 AM, baris.ka...@oracle.com wrote: Erick, Cool, could You give a simple example with my example please? Best regards On 6/13/19 10:12 AM, Erick Erickson wrote: Shot in the dark: stemming. When

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
However, the index does not have MAINS but MAIN for the expected entry. Best regards On 6/13/19 10:33 AM, baris.ka...@oracle.com wrote: does it consider it as like plural word? :) :) :) That makes sense. Best regards On 6/13/19 10:31 AM, baris.ka...@oracle.com wrote: Erick, Cool, could Y

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
i see, i am using an older version 6.6 and we should switch to Your 8.1 version of at least 7.X. Tomoko i think i understood You meant MAIN NASHUA for the string :) Again i really appreciate all answers. How do we disable or enable stemming while indexing? :) another question. Best regar

Re: Best fuzzy match on multiple terms

2019-06-13 Thread baris . kazar
i would suggest trying (indexing and searching) without === ' === s and see You can find it first. Thanks On 6/13/19 11:25 AM, Matthias Müller wrote: I am currently matching botanic names (with possible mis-spellings) against an indexed referenced list with Lucene. After quick progress in the

Re: Best fuzzy match on multiple terms

2019-06-14 Thread baris . kazar
These are great suggestions, i was going to suggest explain plan of query, too. i really wonder in Your case why 'Rozi' entry does not get higher score. Is there any effect from " ' " chars? In my case i have sort of reverse situation: my query is maink~2 (mains was a special case where i st

Re: FuzzyQuery- why is it ignored?

2019-06-15 Thread Baris Kazar
Hello,- Erick explained how to disable stemming in Solr but i am using Lucene purely. i am also researching how to disable it in Lucene but if You have instructions how to do so already i appreciate if You could share here. Best regards - Original Message - From: baris.ka...@oracle.com T

Re: FuzzyQuery- why is it ignored?

2019-06-16 Thread Baris Kazar
16, 2019 4:39:29 AM GMT -05:00 US/Canada Eastern Subject: Re: FuzzyQuery- why is it ignored? Hi, you said you are using standard analyzer. If so, you are not using any stemmer at all (please see the analyzer's Javadocs). 2019年6月16日(日) 11:43 Baris Kazar : > > Hello,- > Erick ex

Re: FuzzyQuery- why is it ignored?

2019-06-19 Thread baris . kazar
i still cannot find the reason why MAINS cannot be found by the Lucene index with StandardAnalyzer. MAINZ, MAINK, MAINT all works ok. Any suggestions please? Best regards On 6/16/19 9:38 AM, Baris Kazar wrote: Tomoko,- Yes, i noticed that last nite when i was researching it and

Re: FuzzyQuery- why is it ignored?

2019-06-22 Thread Baris Kazar
Tomoko,- may i ask if You could try with these few more data indexed too? "KEHOE NASHUA HILLSBOROUGH NEW HAMPSHIRE UNITED STATES" "CHESTNUT NASHUA HILLSBOROUGH NEW HAMPSHIRE UNITED STATES" "JEFFERSON NASHUA HILLSBOROUGH NEW HAMPSHIRE UNITED STATES" "NASHUA HILLSBOROUGH NEW HAMPSHIRE UNITED STATES

Re: FuzzyQuery- why is it ignored?

2019-06-22 Thread Baris Kazar
esults/conditions are provided. 2019年6月23日(日) 10:40 Baris Kazar : > > Tomoko,- > i will surely try on my env version 8.1 > but if You could also try then both runs will > make sure it is bug. > No problems at all. i will test it. > I need to ask one thing when you ran the ex

A possible Java exception message fix

2019-06-24 Thread baris . kazar
Index created with Lucene 6.6 and according to that,  this error message might need update:-> This version of Lucene only supports indexes created with release 7.0 and later. The first part of the error message is consistent with the error: -> 6 (needs to be between 7 and 9). Hope this helps

Re: A possible Java exception message fix

2019-06-24 Thread baris . kazar
Ok i forgot to mention below that i was trying to run 6.6. index with Lucene 8.1.1. Best regards On 6/24/19 2:03 PM, baris.ka...@oracle.com wrote: Index created with Lucene 6.6 and*i tried running the same index with Lucene 8.1 *and according to that,  this error message might need update:

Re: A possible Java exception message fix

2019-06-24 Thread baris . kazar
I already explained on the message line and in the email body: the exception message needs a fix and i explained in detail below. Thanks On 6/24/19 2:06 PM, Erick Erickson wrote: What are you asking here? Indeed, Lucene 8 (and therefore Solr) will not open an index that has ever been touched by

Re: A possible Java exception message fix

2019-06-24 Thread baris . kazar
Thsi exception message: This version of Lucene only supports indexes created with release 6.0 and later. needs to change to  This version of Lucene only supports indexes created with release 7.0 and later. Thanks On 6/24/19 4:30 PM, baris.ka...@oracle.com wrote: I already explained on t

StanardFilter Question : https://issues.apache.org/jira/browse/LUCENE-8356

2019-06-24 Thread baris . kazar
According to this jira ticket, where else is StandardFilter included in Lucene 8.1.1? and why is it a no-op now in Lucene 8.1.1? I wish the tickets were a bit more explicit and suggest what to use instead for deprecated versions like in version 7.5.0 or why it became no-op in version 8.1.1?

Re: FuzzyQuery- why is it ignored?

2019-06-25 Thread baris . kazar
asked about this in another thread. On 6/23/19 12:29 AM, Baris Kazar wrote: oops sorry about this, i also automatically did reply to and assumed it went to the list. i totally agree about your recommendations and totally agree it should have been sent to the forum emailing list. please

Re: StanardFilter Question : https://issues.apache.org/jira/browse/LUCENE-8356

2019-06-25 Thread baris . kazar
Hi,-  do You mean there is a backward compatibility factory in Lucene for these kinds of cases? i think it can be fixed like this,  In other words is the following first line redundant then? TokenStream filter = new StandardFilter(tokenizer); -> redundant (tokenizer is actually a StandardT

Re: StanardFilter Question : https://issues.apache.org/jira/browse/LUCENE-8356

2019-06-25 Thread baris . kazar
Corrected a typo below in the new code. Best regards On 6/25/19 5:01 PM, baris.ka...@oracle.com wrote: Hi,-  do You mean there is a backward compatibility factory in Lucene for these kinds of cases? i think it can be fixed like this,  In other words is the following first line redundant t

Scoring in Lucene 6.6.0, 7.7.2, 8.1

2019-06-25 Thread baris . kazar
Hi,-  i really want to know why the scoring works this way: search String is either MAINO or MAINS: MAIN appears as the 276th entry in the results. NEW HAMPSHIRE in results: city="NASHUA" municipality="HILLSBOROUGH" region="NEW HAMPSHIRE" country="UNITED STATES" in the 0 th result NEW HAMPSHI

Re: Scoring in Lucene 6.6.0, 7.7.2, 8.1

2019-06-26 Thread baris . kazar
Yes, i know that feature but so far it did not help me much but i am still looking into that. Thanks On 6/26/19 2:41 AM, Adrien Grand wrote: You can use IndexSearcher#explain to see how scores are computed. On Wed, Jun 26, 2019 at 12:48 AM wrote: Hi,- i really want to know why the scor

  1   2   3   >