Maciej Lizewski created SOLR-5276:
-------------------------------------
Summary: highlighter working using stemmed tokens from another
field and text from another
Key: SOLR-5276
URL: https://issues.apache.org/jira/browse/SOLR-5276
Project: Solr
Issue Type: Improvement
Components: highlighter
Reporter: Maciej Lizewski
Priority: Minor
The case is like this:
I have 'content' field with content extracted with tika and several fields for
language versions (like content_pl, content_en, content_es, content_ru, etc).
I also use custom langid component which can copy 'content' to serveral
content_* fields if it detects more than one language (so those parts are
properly stemmed in every language present in text).
Now to use highlighter in such scenario I need to store all those language
fields even if their contents is always same as the one in 'content' field.
Would be nice if I could configure language specific fields to be not stored,
and configure highlighter to take tokens positions from those fields and apply
them to text in 'content' field...
In other words - to say: take tokens from 'content_pl', and prepare highlight
based on text in 'content' field.
It could be administrator responsibility to guarantee that mapped fields have
same content.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]