Hi Artem, The main problem is that your schema is using the
DefaultAnalyzerFactory, which (roughly speaking) tokenizes based on
punctuation, discards any tokens less than three characters in length, and
converts the tokens to lowercase. I'd recommend that you use the
WhitespaceAnalyzerFactory, which (as the name suggests) tokenizes based on
whitespace. (You can do this by setting the 'analyzer_factory' schema
setting to "com.basho.search.analysis.WhitespaceAnalyzerFactor"). This will
cause your queries to act in the way you expect.

Once the next release of Riak Search is out, I'd recommend that you set
field-level analyzers according to the values you expect to put in the
different fields. So the "title" and "summary" fields should use the Default
analyzer, "storylink" and "fieldlink" should use the Whitespace analyzer,
and any integer fields should use the Integer analyzer. The reason I say to
do that after installing the *next* release is because there is a bug in the
*current* version of Riak Search whereby field-level analyzer settings are
not respected. This is fixed in the next release.

One last thing to mention: wildcards are only supported at the end of a
word. (So your query for "*www.toolboxrecords.com*" isn't working the way
you expect, and is probably returning results because it is getting split
into three separate queries for "www", "toolboxrecords", and "com".)

Best,
Rusty

On Sun, Dec 5, 2010 at 5:32 AM, Artem Golovinsky
<artemgolovin...@gmail.com>wrote:

> Hi,
> I have some questions about Riak Search.
>
> I stored just one entry to RS:  http://pastebin.com/2b5tybD1
>
> My bucket has following schema: http://pastebin.com/dtnM3diL
>
> 1. I need to get all objects where "storylink": "
> http://www.toolboxrecords.com/fr/product/12834/dubstep-grime-ukg/ruff-11/
> "
> When I try to get it by query -
> http://localhost:8098/solr/myfeed13/select?wt=json&q=storylink:"http://www.toolboxrecords.com/fr/product/12834/dubstep-grime-ukg/ruff-11/<http://localhost:8098/solr/myfeed13/select?wt=json&q=storylink:%22http://www.toolboxrecords.com/fr/product/12834/dubstep-grime-ukg/ruff-11/%22>"
> , I get nothing.
> When I try to make query
> http://localhost:8098/solr/myfeed13/select?wt=json&q=storylink:"*www.toolboxrecords.com*<http://localhost:8098/solr/myfeed13/select?wt=json&q=storylink:%22*www.toolboxrecords.com*%22>"
> , all works correctly and I get content.
> Could you please describe how can I get content through first query?
>
> 2. I get my object by query
> http://localhost:8098/solr/myfeed13/select?wt=json&q=knownentities_type:"Country<http://localhost:8098/solr/myfeed13/select?wt=json&q=knownentities_type:%20%22Country%22>"
> or
> http://localhost:8098/solr/myfeed13/select?wt=json&q=knownentities_relevance:"0.914286<http://localhost:8098/solr/myfeed13/select?wt=json&q=knownentities_relevance:%20%220.914286%22>
> "
> But I can't get they by 
> http://localhost:8098/solr/myfeed13/select?wt=json&q=knownentities_count:
> "1<http://localhost:8098/solr/myfeed13/select?wt=json&q=knownentities_count:%20%221%22>"
> or http://localhost:8098/solr/myfeed13/select?wt=json&q=knownentities_text:
> "UK"<http://localhost:8098/solr/myfeed13/select?wt=json&q=knownentities_text:%20%22UK%22>
>
> Why does it happen? However, all these fields are described similar in
> schema.
>
> 3. I try to get my entry  -
> http://localhost:8098/solr/myfeed13/select?wt=json&q=categoryscores_business:"0.*<http://localhost:8098/solr/myfeed13/select?wt=json&q=categoryscores_business:%220.*%22>
> "
> But result is nothing.
>
> As you can see "business" is nested field for "categoryscores" field.
> How can I get this entry?
>
>
> --
> Best regards,
> Artem Golovinsky
>
> _______________________________________________
> riak-users mailing list
> riak-users@lists.basho.com
> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com
>
>
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to