Joel Bernstein created SOLR-9240:
------------------------------------
Summary: Add the partitionKeys parameter to the topic() Streaming
Expression
Key: SOLR-9240
URL: https://issues.apache.org/jira/browse/SOLR-9240
Project: Solr
Issue Type: Improvement
Reporter: Joel Bernstein
Currently the topic() function doesn't accept a partitionKeys parameter like
the search() function does. This means the topic() function can't be wrapped by
the parallel() function to run across worker nodes.
It would be useful to support parallelizing the topic function because it would
provide a general purpose parallelized approach for processing batches of data
as they enter the index.
For example this would allow a classify() function to be wrapped around a
topic() function to classify documents in parallel across worker nodes.
Sample syntax:
{code}
parallel(daemon(update(classify(topic(..., partitionKeys="id")))))
{code}
The example above would send a daemon out to worker nodes that would classify
all new documents returned by the topic() function. The update function would
send the output of classify() to a SolrCloud collection for indexing.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]