The current ordering of JapaneseAnalyser's token filters is as follows:
1. JapaneseBaseFormFilter
2. JapanesePartOfSpeechStopFilter
3. CJKWidthFilter (similar to NormaliseFilter)
4. StopFilter
5. JapaneseKatakanaStemFilter
6. LowerCaseFilter
Our existing support for E
(Note: cross-posted to various lucene user lists, if you have replies
please keep them on general@lucene, but pleast note that specific
questions should be addressed to travel-assista...@apache.org)
- - - Forwarded Announcement - - -
The Travel Assistance Committee (TAC) are pleased
the lengths of fields are encoded and lose some precision. So
I suspect the length of the field calculated for the two documents
are the same after encoding.
Adding &debug=all to the query will show you if this is the case.
Best
Erick
On Wed, Jan 15, 2014 at 3:39 AM, andy wrote:
> Hi guys,
>
>
Hi guys,
As the topic,it seems that the length of filed does not affect the doc score
accurately for chinese analyzer in my source code
index source code
private static Directory DIRECTORY;
@BeforeClass
public static void before() throws IOException {
DIRECTORY = new RAMDire