在google里搜一下中文分词,出车东的包外,应该还有很多了,如果你发现有更好分词,更高效率的,也推荐一份啊。
--
From: "kai.hu" <[EMAIL PROTECTED]>
Sent: Sunday, May 04, 2008 4:20 PM
To:
Subject: Re: Need addtional info for Field(希望看得懂中文的朋友帮我出出主意)
你只要索引并分词“下午去开会”就行了,把对应的时间保存进去。
如
你只要索引并分词“下午去开会”就行了,把对应的时间保存进去。
如document.add(new Field("sub","下午去开会",Field.Store.YES,Field.Index.TOKENIZED));
document.add(new
Field("time","01:02:02",Field.Store.YES,Field.Index.UN_TOKENIZED));
到时候搜索出的单个document里就包含这两个Field了。
only index and tokenized "下午去开会",and store the time with this sub.
i got a problem yesterday,
java.lang.NullPointerException
at
org.apache.lucene.index.DocumentsWriter$ThreadState.init(DocumentsWriter.java:751)
at
org.apache.lucene.index.DocumentsWriter.getThreadState(DocumentsWriter.java:2391)
at
org.apache.lucene.index.DocumentsWriter.updateDocument(Documen