Citát Aki Tuomi <[email protected]>:
On 06.04.2017 14:58, [email protected] wrote:
Hi,
i'm trying to resolve few problems with indexing 'From' headers using
FTS/Solr. I was tcpdumping the communication between Dovecot and
Jetty/Solr and noticed that 'From' headers, which includes also
sender's name, are double escaped. This is what was Dovecot sending to
Solr:
</field><field name="from">Name Surname
&lt;[email protected]&gt;</field></doc></add>
As you can see, characters < and > were escaped to < and > which
were, again, escaped to &lt; and &gt;. This is doing problems
while trying to index whole e-mail address, as Solr sees it as
'<[email protected]>'.
I spend hours trying to figure out why i'm able to search in all parts
of e-mail addresses but searching for full and exact e-mail address
was successfull ONLY for messages which doesn't include sender's name
in 'From' header. Finally, after i found this bug, this fixed all
search problems:
<filter class="solr.PatternReplaceFilterFactory" pattern="&lt;"
replacement=""/>
<filter class="solr.PatternReplaceFilterFactory" pattern="&gt;"
replacement=""/>
I hope that, at least, this bug, reported by me, will be fixed. Thank
you.
azur
Hi!
Which dovecot version was this?
Aki
Sorry, forgot to mention it, 2.2.27, Debian Jessie (backports), 64bit.