[ https://issues.apache.org/jira/browse/SOLR-17346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17866007#comment-17866007 ]
ASF subversion and git services commented on SOLR-17346: -------------------------------------------------------- Commit 991e76171e489e5f655d2dda7b0cab40177e5e57 in solr's branch refs/heads/pr/2550 from Alastair Porter [ https://gitbox.apache.org/repos/asf?p=solr.git;h=991e76171e4 ] SOLR-17346: Synchronise stopwords from snowball with those in lucene (#2533) > Synchronise default configset stopwords to the same list as lucene > ------------------------------------------------------------------ > > Key: SOLR-17346 > URL: https://issues.apache.org/jira/browse/SOLR-17346 > Project: Solr > Issue Type: Improvement > Security Level: Public(Default Security Level. Issues are Public) > Reporter: Alastair Porter > Priority: Trivial > Time Spent: 10m > Remaining Estimate: 0h > > Solr's default configset comes with a collection of sample stopwords from the > snowball project in solr/server/solr/configsets/_default/conf/lang > (https://github.com/apache/solr/tree/a42c605fb916439222a086356f368f02cf80304a/solr/server/solr/configsets/_default/conf/lang) > There is a similar list of stopwords in the lucene repository, however these > have been updated to a more recent list of snowball > ([https://github.com/apache/lucene/tree/main/lucene/analysis/common/src/resources/org/apache/lucene/analysis/snowball)] > Specifically, the most recent list of stopwords for the french language has > removed a number of words which are homonyms of other useful words which > shouldn't be skipped. > In a discussion on the solr-users mailing list it was agreed that it would be > a good idea to sync the list of files in solr with the ones in lucene. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org For additional commands, e-mail: issues-h...@solr.apache.org