[
https://issues.apache.org/jira/browse/SOLR-4565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13694586#comment-13694586
]
Erlend Garåsen commented on SOLR-4565:
--------------------------------------
{quote}
Here's a patch showing what i mean...
{quote}
+1
I can create another patch including these changes.
{quote}
also some of the endings should be reviewed, because tests didnt pass.
i noticed -heten was configured for Nynorsk-only, but its expected to be
removed according to the nb_light.txt test file.
{quote}
The tests pass. -heten is handled correctly if you take a look in my first
patch. -heten should only be configured for Bokmål, not Nynorsk:
{code}
+ (endsWith(s, len, "heten") &&
+ useBokmaal(variant)) || // general ending (hemmelig-heten ->
hemmelig)
{code}
The equivalent for this ending using Nynorsk is "-heita".
My summer vacation starts tomorrow, so it might take a couple of weeks till I
have another patch ready - unless I get some time to fulfill this task tomorrow.
> Extend NorwegianMinimalStemFilter to handle "nynorsk"
> -----------------------------------------------------
>
> Key: SOLR-4565
> URL: https://issues.apache.org/jira/browse/SOLR-4565
> Project: Solr
> Issue Type: Improvement
> Components: Schema and Analysis
> Reporter: Jan Høydahl
> Attachments: SOLR-4565.patch, SOLR-4565.patch
>
>
> Norway has two official languages, both called "Norwegian", namely Bokmål
> (nb_NO) and Nynorsk (nn_NO).
> The NorwegianMinimalStemFilter and NorwegianLightStemFilter today only works
> with the largest of the two, namely Bokmål.
> Propose to incorporate "nn" support through a new "vaiant" config option:
> * variant="nb" or not configured -> Bokmål as today
> * variant="nn" -> Nynorsk only
> * variant="no" -> Remove stems for both nb and nn
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]