[ 
https://issues.apache.org/jira/browse/SOLR-15779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17441231#comment-17441231
 ] 

Walter Underwood commented on SOLR-15779:
-----------------------------------------

The stopword filter should be removed from all example configs.

At Netflix in 2007, these movie titles were converted to empty strings by 
stopword removal.
 * [Being There|http://www.netflix.com/Movie/Being_There/60004552] (this is the 
first one I noticed)
 * [To Be and To Have (Être et 
Avoir)|http://www.netflix.com/Movie/To_Be_and_To_Have/60031224]
 * [To Have and To Have 
Not|http://www.netflix.com/Movie/To_Have_and_Have_Not/60027940]
 * [Once and 
Again|http://www.netflix.com/Movie/Once_and_Again_Season_1/60030531]
 * [To Be or Not To Be 
(1942)|http://www.netflix.com/Movie/To_Be_or_Not_To_Be/60011755] (OK, it isn’t 
just a quote from Hamlet)
 * [To Be or Not To Be 
(1983)|http://www.netflix.com/Movie/To_Be_or_Not_To_Be/70046292]
 * [Now and Then, Here and 
There|http://www.netflix.com/Movie/Now_and_Then_Here_and_There_Vol._1_Discord_and_Doom/60022187]
 * [Be with Me|http://www.netflix.com/Movie/Be_with_Me/70058486]
 * [I’ll Be There|http://www.netflix.com/Movie/I_ll_Be_There/60026116]
 * [It Had to Be You|http://www.netflix.com/Movie/It_Had_to_Be_You/60024124]
 * [You Should Not Be 
Here|https://www.netflix.com/Movie/You_Should_Not_Be_Here/70002914]
 * [You Are Here|http://www.netflix.com/Movie/You_Are_Here/60022152]

https://observer.wunderwood.org/2007/05/31/do-all-stopword-queries-matter/

> Remove Stopwords from sample_techproducts_config?
> -------------------------------------------------
>
>                 Key: SOLR-15779
>                 URL: https://issues.apache.org/jira/browse/SOLR-15779
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: examples
>    Affects Versions: main (9.0)
>            Reporter: David Eric Pugh
>            Priority: Minor
>
> A recent thread on the user mailing list drove home to me that folks use the 
> {{sample_techproducts_config}} as a guide to good practices...
> https://lists.apache.org/thread/xoq5omk6om2z4tx71k1c8v50hkmv1jvp
> I know I've looked at it plenty over the years to figure out things!   
> Thoughts on removing the stopwords filter from the example?   Thoughts on 
> maybe even tweaking the text 
> https://solr.apache.org/guide/8_10/filter-descriptions.html#stop-filter to 
> highlight some of the challenges in using stop words?   or even, gasp, remove 
> it?



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org

Reply via email to