date:20140115

JapaneseAnalyser filter ordering

2014-01-15 Thread Trejkaz

The current ordering of JapaneseAnalyser's token filters is as follows: 1. JapaneseBaseFormFilter 2. JapanesePartOfSpeechStopFilter 3. CJKWidthFilter (similar to NormaliseFilter) 4. StopFilter 5. JapaneseKatakanaStemFilter 6. LowerCaseFilter Our existing support for E

ANNOUNCE: ApacheCon NA 2014 Travel Assistance Applications now open!

2014-01-15 Thread Chris Hostetter

(Note: cross-posted to various lucene user lists, if you have replies please keep them on general@lucene, but pleast note that specific questions should be addressed to travel-assista...@apache.org) - - - Forwarded Announcement - - - The Travel Assistance Committee (TAC) are pleased

Re: Length of the filed does not affect the doc score accurately for chinese analyzer(SmartChineseAnalyzer)

2014-01-15 Thread Erick Erickson

the lengths of fields are encoded and lose some precision. So I suspect the length of the field calculated for the two documents are the same after encoding. Adding &debug=all to the query will show you if this is the case. Best Erick On Wed, Jan 15, 2014 at 3:39 AM, andy wrote: > Hi guys, > >

Length of the filed does not affect the doc score accurately for chinese analyzer(SmartChineseAnalyzer)

2014-01-15 Thread andy

Hi guys, As the topic,it seems that the length of filed does not affect the doc score accurately for chinese analyzer in my source code index source code private static Directory DIRECTORY; @BeforeClass public static void before() throws IOException { DIRECTORY = new RAMDire