preko-p opened a new pull request, #1935: URL: https://github.com/apache/stormcrawler/pull/1935
Fixes #1143. ## Summary - Add per-URL proxy selection from URL metadata for the built-in proxy managers. - Support `http.proxy.skip=true`, full `http.proxy` connection strings, and component metadata keys (`http.proxy.host`, `http.proxy.port`, `http.proxy.type`, `http.proxy.user`, `http.proxy.pass`). - Validate invalid metadata proxy values instead of falling back to configured proxies. - Keep configured proxy defaults and rotation behavior unchanged when metadata is absent. - Document metadata proxy precedence and behavior. ## Verification - `git diff --check` - Docker: `mvn -B -ntp -Dskip.format.code=false -pl core git-code-format:format-code -DskipTests` - Docker: `mvn -B -ntp -pl core -Dtest=SingleProxyManagerTest,MultiProxyManagerTest,HttpProtocolProxyConcurrencyTest,HttpClientProtocolProxyManagerTest,FetcherBoltTest,SimpleFetcherBoltTest -DfailIfNoTests=false test` Focused selector result: `Tests run: 44, Failures: 0, Errors: 0, Skipped: 0`, `BUILD SUCCESS`. Note: I also attempted Docker `mvn clean verify`. The `core` module passed 264 tests and the reactor advanced past `core`, but the full reactor later failed in `stormcrawler-opensearch` on JaCoCo threshold failures unrelated to this patch, so I am not claiming a full-repo pass. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
