[
https://issues.apache.org/jira/browse/SOLR-2894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hoss Man updated SOLR-2894:
---------------------------
Attachment: SOLR-2894_cloud_test.patch
bq. I'm not able to reproduce this. Could you tell me a little more about your
setup?
trunk, with patch applied, build the example and then run the [Simple Two-Shard
Cluster|https://cwiki.apache.org/confluence/display/solr/Getting+Started+with+SolrCloud#GettingStartedwithSolrCloud-SimpleTwo-ShardClusterontheSameMachine]
...
{noformat}
hossman@frisbee:~/lucene/dev/solr$ cp -r example node1
hossman@frisbee:~/lucene/dev/solr$ cp -r example node2
# in term1...
hossman@frisbee:~/lucene/dev/solr/node1$ java -DzkRun -DnumShards=2
-Dbootstrap_confdir=./solr/collection1/conf -Dcollection.configName=myconf -jar
start.jar
# wait for node1 startup, then in term2...
hossman@frisbee:~/lucene/dev/solr/node2$ java -Djetty.port=7574
-DzkHost=localhost:9983 -jar start.jar
# wait for node2 startup, then in term3...
hossman@frisbee:~/lucene/dev/solr/example/exampledocs$ java -jar post.jar *.xml
SimplePostTool version 1.5
Posting files to base url http://localhost:8983/solr/update using content-type
application/xml..
...
14 files indexed.
COMMITting Solr index changes to http://localhost:8983/solr/update..
Time spent: 0:00:01.763
hossman@frisbee:~/lucene/dev/solr/example/exampledocs$ curl
'http://localhost:8983/solr/select?q=*:*&sort=id+desc&rows=2&facet=true&facet.pivot=cat,manu_+id_s,inStock&facet.limit=3'
> /dev/null
# watch the logs in term1 and term2 go spinning like mad
{noformat}
----
bq. While the size of the shard parameters may not strictly be as efficient as
possible, is it such that we can run with that for now and circle back to this
at a later point, or are you uncomfortable with including the parameters as is
in the initial commit?
Hmm... not sure how i feel about it w/o more testing - from what i was seeing,
with non-trivial field names, term values, and facet.limit the refinements
requests were getting *HUGE* so I suspect it's something we're going to want to
tackle before releasing -- but refactoring it to be smaller definitely seems
like something that should be a lower priority to some of the correctness
related issues we're finding, and adding more tests (so we can be confident the
refactoring is correct)
----
I'm attaching a "SOLR-2894_cloud_test.patch" that contains a new cloud based
randomized test i've been working at off and on over the last few days (I
created it as a standalone patch because i didn't want to conflict with
anything Brett might be in the middle of, and it was easy to do - kept me
focused on the test and not dabbling with the internals).
The test builds up a bunch of random docs, then does a handfull of random pivot
facet queries. For each pivot query, it recursively walks the pivot response
executing verification queries using "fq" params it builds up from the pivot
constraints -- so if pivot.facet=a,b,c says that "a" has a term "x" with 4
matching docs, it adds an "fq=a:x" to the original query and checks the count;
then it looks a the pivot terms for field "b" under "a:x" and also executes a
query for each of them with another fq added, etc...
As is, the patch currently passes, but that's only because of a few nocommits...
* randomization of mincount is disabled due to the refinement bug i mentioned
before
* it's currently only doing pivots on 2 string fields (one multivalued and one
single valued) ... any attempts at pivot faceting the numeric/date/boolean
fields (already included in the docs) causes an NPE in the SolrJ QueryResponse
class (i haven't investigated why yet)
> Implement distributed pivot faceting
> ------------------------------------
>
> Key: SOLR-2894
> URL: https://issues.apache.org/jira/browse/SOLR-2894
> Project: Solr
> Issue Type: Improvement
> Reporter: Erik Hatcher
> Fix For: 4.9, 5.0
>
> Attachments: SOLR-2894-reworked.patch, SOLR-2894.patch,
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch,
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch,
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch,
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch,
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch,
> SOLR-2894_cloud_test.patch, dateToObject.patch, pivot_mincount_problem.sh
>
>
> Following up on SOLR-792, pivot faceting currently only supports
> undistributed mode. Distributed pivot faceting needs to be implemented.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]