Re: Doing the lucene remove character \n (break line)

Erick Erickson Sun, 23 Nov 2008 17:50:19 -0800

What I'd do is make my own filter, probably one based upon one of
the pre-existing ones and modify the call to nextToken, examine that
token, and if it ends in a hyphen get the next token and return the
concatenation of the two. I don't believe that there's a pre-existing
filter that does this, but you might want to check because I haven't
looked at them an a while.


Best
Erick

On Sun, Nov 23, 2008 at 4:40 PM, farnetani <[EMAIL PROTECTED]> wrote:

>
> I need to do lucene find the sentence:
> ARLEI FERREIRA FARNETANI JUNIOR
> [arlei] [ferreira] [farnetani] [junior]    (1)
>
> and too:
>
> ARLEI FERREIRA FAR-   <break line>
> NETANI JUNIOR
>
> I'm using the Brazilian Analyzer, but the result is:
> [ARLEI] [FERREIRA] [FAR] [NETANI] [JUNIOR]
>
> I have to do that the lucene result:
> [ARLEI] [FERREIRA] [FARNETANI] [JUNIOR] equals the sentence (1)
>
> So I have to do that lucene remove "-" hyphen and the break line (\n).
>
> To remove character hyphen (-) I got, but remove the break line no!
>
> How do I do???
> --
> View this message in context:
> http://www.nabble.com/Doing-the-lucene-remove-character-%5Cn-%28break-line%29-tp20650540p20650540.html
> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>

Re: Doing the lucene remove character \n (break line)

Reply via email to