Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-10 Thread via GitHub
jnioche merged PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-09 Thread via GitHub
mvolikas commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2466231094 Hi there! @jnioche I think I have made the changes; also pushed some comments and minor fixes to the readme files. I ran some more tests with a greater numbe

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-04 Thread via GitHub
jnioche commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2456389209 @mvolikas, latest comments * need to bring in change from #1390 ` storm jar target/crawler-1.0-SNAPSHOT.jar org.apache.storm.flux.Flux injection.flux --l

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
mvolikas commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453478494 > > > We need one for the injection @mvolikas > > > > > > Ok, so I guess I will add this back. > > Sorry if I wasn't clear - we need a Flux for the inject

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
mvolikas commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-245341 An update from my side: - I have now tested in local mode with 1 and 4 shards. - I have updated the `SolrSpout` code so that the query param for shards is not ad

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
jnioche commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453474014 > > We need one for the injection @mvolikas > > Ok, so I guess I will add this back. Sorry if I wasn't clear - we need a Flux for the injection, not the Java

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
mvolikas commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453471066 > We need one for the injection @mvolikas Ok, so I guess I will add this back. -- This is an automated message from the Apache Git Service. To respond to the mess

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
mvolikas commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1827008942 ## external/solr/archetype/src/main/resources/archetype-resources/src/main/java/CrawlTopology.java: ## Review Comment: Thanks, it's clear now. I h

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
rzo1 commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453450532 Bascially, the Java topologies are only good for testing in local mode (IMHO) and are actually only usable, if the IDE is configured to include the `provided` scoped dependen

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
jnioche commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453448149 > The actually issue is, that the template misses an import for `import org.apache.stormcrawler.ConfigurableTopology;` -> #1389 The fact that it has been broken for

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
jnioche commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453448482 > Yes I have not yet tested with the java topologies The compilation fails whether you use the Java topologies or not... -- This is an automated message from t

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
rzo1 commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1826993141 ## external/solr/archetype/src/main/resources/archetype-resources/src/main/java/CrawlTopology.java: ## Review Comment: It also requires the missing im

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
rzo1 commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453444795 > > ``` > [ERROR] COMPILATION ERROR : > [INFO] - > [ERROR] /home/markos/apache/test/src/main/java/tes

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
rzo1 commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1826991828 ## external/solr/archetype/src/main/resources/archetype-resources/src/main/java/CrawlTopology.java: ## Review Comment: This class file needs to be ren

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-03 Thread via GitHub
mvolikas commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453435316 > @mvolikas compiling the project generated from the archetype fails with > > ``` > [ERROR] Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-02 Thread via GitHub
jnioche commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453145042 @mvolikas The Java based topologies could actually go. We don't have them in the OpenSearch module and I think the huge majority of people just rely on the Flux files.

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-02 Thread via GitHub
jnioche commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453064634 @mvolikas compiling the project generated from the archetype fails with ``` [ERROR] Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:3.11.0:

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-02 Thread via GitHub
jnioche commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2453063805 archetype generated successfully, no idea why it had failed -- This is an automated message from the Apache Git Service. To respond to the message, please log on

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-02 Thread via GitHub
mvolikas commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1826565193 ## external/solr/src/test/java/org/apache/stormcrawler/solr/persistence/SpoutTest.java: ## @@ -0,0 +1,169 @@ +/* + * Licensed to the Apache Software Fo

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-11-02 Thread via GitHub
mvolikas commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2452974989 > Running the archetype generation with > > mvn archetype:generate -DarchetypeGroupId=org.apache.stormcrawler -DarchetypeArtifactId=stormcrawler-solr-archetype -Da

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-27 Thread via GitHub
jnioche commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1818169400 ## external/solr/README.md: ## @@ -1,117 +1,30 @@ -stormcrawler-solr -== +# stormcrawler-solr -Set of Solr resources for StormCrawler

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-27 Thread via GitHub
jnioche commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1818167365 ## external/solr/archetype/src/main/resources/archetype-resources/README.md: ## @@ -0,0 +1,144 @@ +This has been generated by the StormCrawler Maven Arc

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-25 Thread via GitHub
jnioche commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2437747972 > > @mvolikas Do you aim to include this PR in **3.1.1** - if so, do you think you can work on the open comments or would it be ok to move it to **3.1.2**? > > From

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-25 Thread via GitHub
rzo1 commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2434597624 @mvolikas Do you aim to include this PR in **3.1.1** - if so, do you think you can work on the open comments or would it be ok to move it to **3.1.2**? -- This is an automa

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-25 Thread via GitHub
mvolikas commented on PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#issuecomment-2435491323 > @mvolikas Do you aim to include this PR in **3.1.1** - if so, do you think you can work on the open comments or would it be ok to move it to **3.1.2**? From my si

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-18 Thread via GitHub
mvolikas commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1806320925 ## external/solr/setup-solr.sh: ## Review Comment: In our team, we currently have an ES deployment for StormCrawler. The main reason for not usin

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-15 Thread via GitHub
rzo1 commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1800926410 ## external/solr/setup-solr.sh: ## Review Comment: Guess it would be good to have an archetype. We can still remove it, if it gets dormant. At least

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-14 Thread via GitHub
jnioche commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1800472853 ## external/solr/src/test/java/org/apache/stormcrawler/solr/persistence/SpoutTest.java: ## @@ -0,0 +1,169 @@ +/* + * Licensed to the Apache Software Fou

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-14 Thread via GitHub
jnioche commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1800469984 ## external/solr/src/main/java/org/apache/stormcrawler/solr/persistence/SolrSpout.java: ## @@ -156,10 +176,15 @@ else if (resetFetchDateAfterNSecs != -1

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-14 Thread via GitHub
jnioche commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1800465804 ## external/solr/src/main/java/org/apache/stormcrawler/solr/persistence/SolrSpout.java: ## @@ -135,7 +157,8 @@ else if (resetFetchDateAfterNSecs != -1)

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-14 Thread via GitHub
jnioche commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1800461361 ## external/solr/setup-solr.sh: ## Review Comment: or in a sub directory? I am not against having an archetype for SOLR but so far there has bee

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-14 Thread via GitHub
mvolikas commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1799842927 ## external/solr/src/test/java/org/apache/stormcrawler/solr/persistence/SpoutTest.java: ## @@ -0,0 +1,169 @@ +/* + * Licensed to the Apache Software Fo

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-14 Thread via GitHub
mvolikas commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1799838080 ## external/solr/src/main/java/org/apache/stormcrawler/solr/persistence/SolrSpout.java: ## @@ -156,10 +176,15 @@ else if (resetFetchDateAfterNSecs != -

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-14 Thread via GitHub
mvolikas commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1799832189 ## external/solr/src/main/java/org/apache/stormcrawler/solr/persistence/SolrSpout.java: ## @@ -135,7 +157,8 @@ else if (resetFetchDateAfterNSecs != -1)

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-13 Thread via GitHub
mvolikas commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1798311949 ## external/solr/src/main/java/org/apache/stormcrawler/solr/persistence/SolrSpout.java: ## @@ -135,7 +157,8 @@ else if (resetFetchDateAfterNSecs != -1)

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-13 Thread via GitHub
mvolikas commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1798303419 ## external/solr/setup-solr.sh: ## Review Comment: Hmm, you are right. But the alternative would be to have the user copy the scripts and the `so

Re: [PR] #620 Add support for shards - SolrSpout [incubator-stormcrawler]

2024-10-13 Thread via GitHub
mvolikas commented on code in PR #1343: URL: https://github.com/apache/incubator-stormcrawler/pull/1343#discussion_r1798272208 ## external/solr/setup-solr.sh: ## @@ -0,0 +1,59 @@ +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreem