Any input/suggestions on parallelizing below operations using Spark over
Java Thread pooling
- reading of 100 thousands json files from local file system
- processing each file content and submitting to Solr as Input document
Thanks,
Susheel
On Mon, Nov 16, 2015 at 5:44 PM, Susheel Kumar
wrote
Hello Spark Users,
My first email to spark mailing list and looking forward. I have been
working on Solr and in the past have used Java thread pooling to
parallelize Solr indexing using SolrJ.
Now i am again working on indexing data and this time from JSON files (in
100 thousands) and before I tr