yzersTokenizersTokenFilters#solr.WordDelimit
erFilterFactory
Uwe
-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de
> -Original Message-
> From: stephen.warner.tho...@gmail.com
> [mailto:stephen.warner.tho...@gmail.com] On Be
>> -Original Message-
>> From: stephen.warner.tho...@gmail.com
>> [mailto:stephen.warner.tho...@gmail.com] On Behalf Of Stephen Thomas
>> Sent: Tuesday, November 29, 2011 5:20 PM
>> To: java-user@lucene.apache.org
>> Subject: Custom Filter for Splitting CamelCase?
e.org
> Subject: Custom Filter for Splitting CamelCase?
>
> List,
>
> I have written my own CustomAnalyzer, as follows:
>
> public TokenStream tokenStream(String fieldName, Reader reader) {
>
> // TODO: add calls to RemovePuncation, and SplitIdentifiers
>
List,
I have written my own CustomAnalyzer, as follows:
public TokenStream tokenStream(String fieldName, Reader reader) {
// TODO: add calls to RemovePuncation, and SplitIdentifiers here
// First, convert to lower case
TokenStream
I figured this one out... it was due to a mistake in my code... sorry
for trouble.
-siraj
On 3/25/2010 5:48 AM, Ian Lea wrote:
Could this maybe have something to do with per-segment readers, as
mentioned in recent message from Daniel?
Posting lucene version and the full stack trace dump is al
Could this maybe have something to do with per-segment readers, as
mentioned in recent message from Daniel?
Posting lucene version and the full stack trace dump is always a good idea.
--
Ian.
On Wed, Mar 24, 2010 at 6:56 PM, Siraj Haider wrote:
> Hello there,
> I am getting exception when run
Hello there,
I am getting exception when running queries with new getDocIdSet() in my
customer filter. Following is the code for my getDocIdSet() function:
/public DocIdSet getDocIdSet(IndexReader reader) throws IOException {
OpenBitSet bitSet = new OpenBitSet(reader.maxDoc());
for (in
Never mind, I think I got it.
-Original Message-
From: OBender [mailto:osya_ben...@hotmail.com]
Sent: Monday, July 20, 2009 4:42 PM
To: java-user@lucene.apache.org
Subject: RE: question on custom filter
No, it reversed in the e-mail. Funny though, when I insert it in to the Excel
it
t (left to right, e.g., EREV left most then TOV). Should
UTF-8 CSV file preserve the natural (language specific) order of words?
-Original Message-
From: Robert Muir [mailto:rcm...@gmail.com]
Sent: Monday, July 20, 2009 3:49 PM
To: java-user@lucene.apache.org
Subject: Re: question on cus
u
:) ?
-Original Message-
From: Robert Muir [mailto:rcm...@gmail.com]
Sent: Monday, July 20, 2009 3:34 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, I think your input is incorrect. The hebrew text you pasted
in your example appears incorrect. Its gonna be h
From: Robert Muir [mailto:rcm...@gmail.com]
> Sent: Monday, July 20, 2009 3:18 PM
> To: java-user@lucene.apache.org
> Subject: Re: question on custom filter
>
> Obender, based on your previous comments (that you see text displayed
> in the wrong order), I again recommend that you en
I've checked, and it appears to be enabled.
-Original Message-
From: Robert Muir [mailto:rcm...@gmail.com]
Sent: Monday, July 20, 2009 3:18 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, based on your previous comments (that you see text display
onday, July 20, 2009 3:03 PM
> To: java-user@lucene.apache.org
> Subject: Re: question on custom filter
>
> Obender, i ran your code and it did what I expected (but not what you pasted):
>
> First token is: (טוֹב,0,4)
> Second token is: (עֶרֶב,5,10)
>
> I also loaded up your Simp
3:03 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, i ran your code and it did what I expected (but not what you pasted):
First token is: (טוֹב,0,4)
Second token is: (עֶרֶב,5,10)
I also loaded up your SimpleWhitespaceAnalyzer in Luke, with the same results.
On M
al String fieldName, final Reader
>> reader )
>> {
>> TokenStream ts = new WhitespaceTokenizer( reader );
>> ts = new XFilter( ts );
>>
>> return ts;
>> }
>> }
>>
>> -Original M
riginal Message-
> From: Robert Muir [mailto:rcm...@gmail.com]
> Sent: Monday, July 20, 2009 2:26 PM
> To: java-user@lucene.apache.org
> Subject: Re: question on custom filter
>
> Obender, I think something in your environment / display environment
> might be causing some confu
che.org
Subject: Re: question on custom filter
Obender, I think something in your environment / display environment
might be causing some confusion.
Are you using microsoft windows? If so, please verify that support for
right-to-left languages is enabled [control panel/regional and
language options].
ser@lucene.apache.org
> Subject: Re: question on custom filter
>
> Obender, This is not true.
> the text you pasted is the following in unicode:
>
> \N{HEBREW LETTER TET}
> \N{HEBREW LETTER VAV}
> \N{HEBREW POINT HOLAM}
> \N{HEBREW LETTER BET}
> \N{SPACE}
> \N{HEBREW LETTER
: Re: question on custom filter
Obender, This is not true.
the text you pasted is the following in unicode:
\N{HEBREW LETTER TET}
\N{HEBREW LETTER VAV}
\N{HEBREW POINT HOLAM}
\N{HEBREW LETTER BET}
\N{SPACE}
\N{HEBREW LETTER AYIN}
\N{HEBREW POINT SEGOL}
\N{HEBREW LETTER RESH}
\N{HEBREW POINT SEGOL
Hold on a second, the phrase that you included link to is not in the correct
order of words!
-Original Message-
From: Robert Muir [mailto:rcm...@gmail.com]
Sent: Monday, July 20, 2009 2:07 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, This is not
ly 20, 2009 2:07 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, This is not true.
the text you pasted is the following in unicode:
\N{HEBREW LETTER TET}
\N{HEBREW LETTER VAV}
\N{HEBREW POINT HOLAM}
\N{HEBREW LETTER BET}
\N{SPACE}
\N{HEBREW LETTER AYIN}
\N{HEBREW
s and indexes).
>
> Am I missing something?
>
> -Original Message-
> From: Robert Muir [mailto:rcm...@gmail.com]
> Sent: Monday, July 20, 2009 1:43 PM
> To: java-user@lucene.apache.org
> Subject: Re: question on custom filter
>
> Obender, I don't think its as dif
ssage-
From: Robert Muir [mailto:rcm...@gmail.com]
Sent: Monday, July 20, 2009 1:43 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, I don't think its as difficult as you think. Your filter does
not need to be aware of this issue at all.
In unicode, rig
Obender, I don't think its as difficult as you think. Your filter does
not need to be aware of this issue at all.
In unicode, right-to-left languages are encoded in the data in logical order.
The rendering system is what converts it to display in right-to-left
for RTL languages.
For example in Ar
Hi All!
Let say I have a filter that produces new tokens based on the original ones.
How bad will it be if my filter sets the start of each token to 0 and end to
the length of a token?
An example (based on the phrase "How are you?":
Original token:
[you?] (8,12)
New tokens:
[you]
: Problem with Custom Filter
I think you're only setting one bit in your filter.
You're docs array is only one cell long, and your termDocs.read reads up to
the length of docs (exactly one in this case) entries. So, you're getting
only one doc ID. And setting it. Even if you made yo
07, Paul Lynch <[EMAIL PROTECTED]> wrote:
Hi,
I am going mad trying to find out what I am doing wrong with my custom
filter implementation (almost an exact copy of SpecialsFilter from LIA). I
have put together a quick sample to illustrate my problem, if some kind soul
has 2 minutes to take a
Hi,
I am going mad trying to find out what I am doing wrong with my custom filter
implementation (almost an exact copy of SpecialsFilter from LIA). I have put
together a quick sample to illustrate my problem, if some kind soul has 2
minutes to take a quick look and tell me where I am being so
2006-12-08 fre 15:41 -0800, Chris Hostetter:
> : Is it necessary to check the deleted-status of documents that the filter
> : includes (and never actually include deleted documents), or is this done
> ...
> : I'm trying to implementent an inverted version of a filter, simply by
> : flipping a
: Is it necessary to check the deleted-status of documents that the filter
: includes (and never actually include deleted documents), or is this done
...
: I'm trying to implementent an inverted version of a filter, simply by
: flipping all the bits in the BitSet, after the filter has fini
Hi,
I have a question that is probably easy to answer for many of you. I'm
using some custom Filters with Lucene, mostly imlemented by using
TermEnum/TermDocs and checking some condition.
Is it necessary to check the deleted-status of documents that the filter
includes (and never actually include
Good call, Chris.I followed the BitSet comparison route and found that
the custom filter was working exactly as it should, but *I* wasn't passing
it correct data. Rookie mistake.
Doh! I hate it when that happens.
-- j
On 9/13/05, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
>
e) that they have greater-than-zero
scores.
b) Is it possible that your filter isn't doing anything all, ie: is there
> any execution path that may return a BitSet in which every bit is set?
It's either doing nothing or not working at all. This seems to be the likely
culprit. St
your problem.
: Date: Tue, 13 Sep 2005 17:22:49 -0700
: From: Jeff Rodenburg <[EMAIL PROTECTED]>
: Reply-To: java-user@lucene.apache.org, [EMAIL PROTECTED]
: To: Chris Hostetter <[EMAIL PROTECTED]>
: Cc: java-user@lucene.apache.org
: Subject: Re: Hits issue or custom filter issue?
:
:
AIL PROTECTED]
: To: Chris Hostetter <[EMAIL PROTECTED]>
: Cc: java-user@lucene.apache.org
: Subject: Re: Hits issue or custom filter issue?
:
: Might be the same issue, haven't been able to determine during a
: step-through on the code exec.
: You're right, no need to add a new Filtered
Might be the same issue, haven't been able to determine during a
step-through on the code exec.
You're right, no need to add a new FilteredQuery to the statement, just a
search on combinedQuery with a new myCustomFilter.
Unfortunately, no joy; same response.
-- j
On 9/13/05, Chris Hostetter <[E
: Hits h1 = oMultiSearcher.Search(new FilteredQuery(combinedQuery, new
: myCustomFilter(1)));
: Hits h2 = oMultiSearcher.Search(new FilteredQuery(combinedQuery, new
: myCustomFilter(2)));
...do you get the same results if you use...
Hits h1 = oMultiSearcher.search(combinedQuery, myCustomFilte
nitial
thought is the problem lies in the custom filter I've created.
myCustomFilter extends Filter, and I'm following the BitSet comparitive
example as found in the LIA book. I've done nothing in myCustomFilter
regarding caching.
I'm doubting this is a bug, but rather something I've overlooked.
thanks,
jeff r.
38 matches
Mail list logo