dev
Thread
Date
Earlier messages
Messages by Thread
[PR] build(deps): bump actions/cache from 6.0.0 to 6.1.0 [stormcrawler]
via GitHub
Re: [PR] build(deps): bump actions/cache from 6.0.0 to 6.1.0 [stormcrawler]
via GitHub
[PR] build(deps): bump org.apache:apache from 38 to 39 [stormcrawler]
via GitHub
[PR] build(deps): bump langchain4j.version from 1.16.3 to 1.17.0 [stormcrawler]
via GitHub
[PR] build(deps): bump software.amazon.awssdk:bom from 2.46.15 to 2.46.17 [stormcrawler]
via GitHub
Re: [PR] build(deps): bump software.amazon.awssdk:bom from 2.46.15 to 2.46.17 [stormcrawler]
via GitHub
[PR] build(deps): bump junit.version from 6.1.0 to 6.1.1 [stormcrawler]
via GitHub
Re: [PR] build(deps): bump junit.version from 6.1.0 to 6.1.1 [stormcrawler]
via GitHub
[PR] build(deps): bump actions/setup-java from 5.3.0 to 5.4.0 [stormcrawler]
via GitHub
Re: [PR] build(deps): bump actions/setup-java from 5.3.0 to 5.4.0 [stormcrawler]
via GitHub
[PR] build(deps): bump actions/cache from 5.0.5 to 6.0.0 [stormcrawler]
via GitHub
Re: [PR] build(deps): bump actions/cache from 5.0.5 to 6.0.0 [stormcrawler]
via GitHub
[PR] use placeholder instead of older version of archetypes in README [stormcrawler]
via GitHub
Re: [PR] use placeholder instead of older version of archetypes in README [stormcrawler]
via GitHub
Re: [I] JSoup parser to handle text/plain [stormcrawler]
via GitHub
[PR] build(deps): bump software.amazon.awssdk:bom from 2.46.10 to 2.46.15 [stormcrawler]
via GitHub
Re: [PR] build(deps): bump software.amazon.awssdk:bom from 2.46.10 to 2.46.15 [stormcrawler]
via GitHub
[PR] build(deps): bump selenium.version from 4.44.0 to 4.45.0 [stormcrawler]
via GitHub
Re: [PR] build(deps): bump selenium.version from 4.44.0 to 4.45.0 [stormcrawler]
via GitHub
[PR] build(deps): bump langchain4j.version from 1.16.2 to 1.16.3 [stormcrawler]
via GitHub
Re: [PR] build(deps): bump langchain4j.version from 1.16.2 to 1.16.3 [stormcrawler]
via GitHub
[PR] build(deps): bump actions/checkout from 6.0.3 to 7.0.0 [stormcrawler]
via GitHub
Re: [PR] build(deps): bump actions/checkout from 6.0.3 to 7.0.0 [stormcrawler]
via GitHub
[PR] Bump actions/setup-java from 5.2.0 to 5.3.0 [stormcrawler]
via GitHub
Re: [PR] Bump actions/setup-java from 5.2.0 to 5.3.0 [stormcrawler]
via GitHub
[GH] (stormcrawler/fix/1955-filespout-distributed-mode): Workflow run "Java CI with Maven" is working again!
GitBox
[PR] #1955 - Make FileSpout work in distributed mode [stormcrawler]
via GitHub
Re: [PR] #1955 - Make FileSpout work in distributed mode [stormcrawler]
via GitHub
Re: [PR] #1955 - Make FileSpout work in distributed mode [stormcrawler]
via GitHub
Re: [PR] #1955 - Make FileSpout work in distributed mode [stormcrawler]
via GitHub
Re: [I] support retry-after in FetcherBolt [stormcrawler]
via GitHub
Re: [I] Protocol-okhttp: implement IP filter [stormcrawler]
via GitHub
[I] [Bug report] FileSpout does not work in distributed mode [stormcrawler]
via GitHub
Re: [I] [Bug report] FileSpout does not work in distributed mode [stormcrawler]
via GitHub
[GH] (stormcrawler/feature/466): Workflow run "Java CI with Maven" is working again!
GitBox
[GH] (stormcrawler/FileSpoutCompressedFiles): Workflow run "Java CI with Maven" is working again!
GitBox
[GH] (stormcrawler/FileSpoutCompressedFiles): Workflow run "Java CI with Maven" is working again!
GitBox
[GH] (stormcrawler/fix/multiproxy-lookup-map): Workflow run "Java CI with Maven" failed!
GitBox
[GH] (stormcrawler/fix/multiproxy-lookup-map): Workflow run "Java CI with Maven" failed!
GitBox
[GH] (stormcrawler/fix/multiproxy-lookup-map): Workflow run "Java CI with Maven" failed!
GitBox
[GH] (stormcrawler/fix/multiproxy-lookup-map): Workflow run "Java CI with Maven" failed!
GitBox
[PR] #1951 refactor: replace linear proxy scan with lookup map in MultiProxyManager [stormcrawler]
via GitHub
Re: [PR] #1951 refactor: replace linear proxy scan with lookup map in MultiProxyManager [stormcrawler]
via GitHub
[PR] #1951 refactor: replace linear proxy scan with lookup map in MultiProxyManager [stormcrawler]
via GitHub
Re: [PR] #1951 refactor: replace linear proxy scan with lookup map in MultiProxyManager [stormcrawler]
via GitHub
Re: [PR] #1951 refactor: replace linear proxy scan with lookup map in MultiProxyManager [stormcrawler]
via GitHub
Re: [PR] #1951 refactor: replace linear proxy scan with lookup map in MultiProxyManager [stormcrawler]
via GitHub
[PR] FileSpout: add support for gzip and bz2 files [stormcrawler]
via GitHub
Re: [PR] FileSpout: add support for gzip and bz2 files [stormcrawler]
via GitHub
Re: [PR] FileSpout: add support for gzip and bz2 files [stormcrawler]
via GitHub
Re: [PR] FileSpout: add support for gzip and bz2 files [stormcrawler]
via GitHub
[I] [Improvement] Replace linear proxy scan in MultiProxyManager with a lookup map [stormcrawler]
via GitHub
Re: [I] [Improvement] Replace linear proxy scan in MultiProxyManager with a lookup map [stormcrawler]
via GitHub
Re: [I] [Improvement] Replace linear proxy scan in MultiProxyManager with a lookup map [stormcrawler]
via GitHub
Re: [I] [Improvement] Replace linear proxy scan in MultiProxyManager with a lookup map [stormcrawler]
via GitHub
Re: [I] [Improvement] Replace linear proxy scan in MultiProxyManager with a lookup map [stormcrawler]
via GitHub
Re: [I] [Improvement] Replace linear proxy scan in MultiProxyManager with a lookup map [stormcrawler]
via GitHub
[PR] Bump org.opensearch.client:opensearch-java from 3.8.0 to 3.9.0 [stormcrawler]
via GitHub
Re: [PR] Bump org.opensearch.client:opensearch-java from 3.8.0 to 3.9.0 [stormcrawler]
via GitHub
[PR] Bump software.amazon.awssdk:bom from 2.46.7 to 2.46.10 [stormcrawler]
via GitHub
Re: [PR] Bump software.amazon.awssdk:bom from 2.46.7 to 2.46.10 [stormcrawler]
via GitHub
[PR] Bump okhttp.version from 5.3.2 to 5.4.0 [stormcrawler]
via GitHub
Re: [PR] Bump okhttp.version from 5.3.2 to 5.4.0 [stormcrawler]
via GitHub
[PR] Bump langchain4j.version from 1.16.1 to 1.16.2 [stormcrawler]
via GitHub
Re: [PR] Bump langchain4j.version from 1.16.1 to 1.16.2 [stormcrawler]
via GitHub
[GH] (stormcrawler/784-retry-after-fetcherbolt): Workflow run "Java CI with Maven" is working again!
GitBox
[GH] (stormcrawler/784-retry-after-fetcherbolt): Workflow run "Java CI with Maven" failed!
GitBox
[PR] #784 - Support Retry-After in FetcherBolt [stormcrawler]
via GitHub
Re: [PR] #784 - Support Retry-After in FetcherBolt [stormcrawler]
via GitHub
Re: [PR] #784 - Support Retry-After in FetcherBolt [stormcrawler]
via GitHub
Re: [PR] #784 - Support Retry-After in FetcherBolt [stormcrawler]
via GitHub
Re: [PR] #784 - Support Retry-After in FetcherBolt [stormcrawler]
via GitHub
Re: [PR] #784 - Support Retry-After in FetcherBolt [stormcrawler]
via GitHub
Re: [PR] #784 - Support Retry-After in FetcherBolt [stormcrawler]
via GitHub
Re: [PR] #784 - Support Retry-After in FetcherBolt [stormcrawler]
via GitHub
Re: [PR] #784 - Support Retry-After in FetcherBolt [stormcrawler]
via GitHub
Re: [PR] #784 - Support Retry-After in FetcherBolt [stormcrawler]
via GitHub
[GH] (stormcrawler/feature/466): Workflow run "Java CI with Maven" failed!
GitBox
[PR] #466 - Handle text/plain content in JSoupParserBolt [stormcrawler]
via GitHub
Re: [PR] #466 - Handle text/plain content in JSoupParserBolt [stormcrawler]
via GitHub
Re: [PR] #466 - Handle text/plain content in JSoupParserBolt [stormcrawler]
via GitHub
Re: [PR] #466 - Handle text/plain content in JSoupParserBolt [stormcrawler]
via GitHub
Re: [PR] #466 - Handle text/plain content in JSoupParserBolt [stormcrawler]
via GitHub
Re: [PR] #466 - Handle text/plain content in JSoupParserBolt [stormcrawler]
via GitHub
Re: [PR] #466 - Handle text/plain content in JSoupParserBolt [stormcrawler]
via GitHub
Re: [PR] #466 - Handle text/plain content in JSoupParserBolt [stormcrawler]
via GitHub
[GH] (stormcrawler/feature/1107-okhttp-ip-filter): Workflow run "Java CI with Maven" failed!
GitBox
[GH] (stormcrawler/feature/1107-okhttp-ip-filter): Workflow run "Java CI with Maven" failed!
GitBox
[PR] #1107 - Add okhttp IP address filter [stormcrawler]
via GitHub
Re: [PR] #1107 - Add okhttp IP address filter [stormcrawler]
via GitHub
Re: [PR] #1107 - Add okhttp IP address filter [stormcrawler]
via GitHub
Re: [PR] #1107 - Add okhttp IP address filter [stormcrawler]
via GitHub
Re: [PR] #1107 - Add okhttp IP address filter [stormcrawler]
via GitHub
Re: [PR] #1107 - Add okhttp IP address filter [stormcrawler]
via GitHub
[PR] docs: fix StormCrawler documentation links [stormcrawler]
via GitHub
Re: [PR] docs: fix StormCrawler documentation links [stormcrawler]
via GitHub
Re: [PR] docs: fix StormCrawler documentation links [stormcrawler]
via GitHub
[PR] Migrate AWS Java SDK from 1.x to 2.x (#1938) [stormcrawler]
via GitHub
Re: [PR] #1938 - Migrate AWS Java SDK from 1.x to 2.x (#1938) [stormcrawler]
via GitHub
Re: [PR] #1938 - Migrate AWS Java SDK from 1.x to 2.x (#1938) [stormcrawler]
via GitHub
Re: [PR] #1938 - Migrate AWS Java SDK from 1.x to 2.x (#1938) [stormcrawler]
via GitHub
Re: [PR] #1938 - Migrate AWS Java SDK from 1.x to 2.x (#1938) [stormcrawler]
via GitHub
[I] [Improvement] Migrate AWS Java SDK from 1.x to 2.x [stormcrawler]
via GitHub
Re: [I] [Improvement] Migrate AWS Java SDK from 1.x to 2.x [stormcrawler]
via GitHub
[PR] Show bare StormCrawler™ name next to download link for trademark registration [stormcrawler-site]
via GitHub
Re: [PR] Show bare StormCrawler™ name next to download link for trademark registration [stormcrawler-site]
via GitHub
[PR] #1923 docs: fix dead links [stormcrawler]
via GitHub
Re: [PR] #1923 docs: fix dead links [stormcrawler]
via GitHub
Re: [PR] #1923 docs: fix dead links [stormcrawler]
via GitHub
Re: [PR] #1923 docs: fix dead links [stormcrawler]
via GitHub
[PR] 1143 Add per-URL proxy metadata selection [stormcrawler]
via GitHub
Re: [PR] 1143 Add per-URL proxy metadata selection [stormcrawler]
via GitHub
Re: [PR] 1143 Add per-URL proxy metadata selection [stormcrawler]
via GitHub
Re: [PR] 1143 Add per-URL proxy metadata selection [stormcrawler]
via GitHub
Re: [PR] 1143 Add per-URL proxy metadata selection [stormcrawler]
via GitHub
Re: [PR] 1143 Add per-URL proxy metadata selection [stormcrawler]
via GitHub
[PR] Bump org.jacoco:jacoco-maven-plugin from 0.8.14 to 0.8.15 [stormcrawler]
via GitHub
Re: [PR] Bump org.jacoco:jacoco-maven-plugin from 0.8.14 to 0.8.15 [stormcrawler]
via GitHub
[PR] Bump langchain4j.version from 1.15.1 to 1.16.1 [stormcrawler]
via GitHub
Re: [PR] Bump langchain4j.version from 1.15.1 to 1.16.1 [stormcrawler]
via GitHub
[PR] Add AGENTS.md + SECURITY.md pointing to the security model (scanner discoverability) [stormcrawler]
via GitHub
Re: [PR] Add AGENTS.md + SECURITY.md pointing to the security model (scanner discoverability) [stormcrawler]
via GitHub
[PR] Improve fault tolerance: handle parse errors, fix logging, and minor cleanup [stormcrawler]
via GitHub
Re: [PR] Improve fault tolerance: handle parse errors, fix logging, and minor cleanup [stormcrawler]
via GitHub
Re: [PR] Improve fault tolerance: handle parse errors, fix logging, and minor cleanup [stormcrawler]
via GitHub
[PR] Fix resource leaks: use try-with-resources and add cleanup hooks [stormcrawler]
via GitHub
Re: [PR] Fix resource leaks: use try-with-resources and add cleanup hooks [stormcrawler]
via GitHub
[PR] Fix thread safety: replace shared Matcher and SimpleDateFormat instances [stormcrawler]
via GitHub
Re: [PR] Fix thread safety: replace shared Matcher and SimpleDateFormat instances [stormcrawler]
via GitHub
Re: [PR] Fix thread safety: replace shared Matcher and SimpleDateFormat instances [stormcrawler]
via GitHub
[PR] Fix thread safety, resource leaks, and error handling across core and external modules [stormcrawler]
via GitHub
Re: [PR] Fix thread safety, resource leaks, and error handling across core and external modules [stormcrawler]
via GitHub
Re: [PR] Fix thread safety, resource leaks, and error handling across core and external modules [stormcrawler]
via GitHub
Re: [PR] Fix thread safety, resource leaks, and error handling across core and external modules [stormcrawler]
via GitHub
[PR] Bump actions/checkout from 6.0.2 to 6.0.3 [stormcrawler]
via GitHub
Re: [PR] Bump actions/checkout from 6.0.2 to 6.0.3 [stormcrawler]
via GitHub
[PR] Fix missing space between block and inline sibling elements in JSoupTextExtractor [stormcrawler]
via GitHub
Re: [PR] Fix missing space between block and inline sibling elements in JSoupTextExtractor [stormcrawler]
via GitHub
[I] JSoupTextExtractor: missing space between block and inline sibling elements [stormcrawler]
via GitHub
Re: [I] JSoupTextExtractor: missing space between block and inline sibling elements [stormcrawler]
via GitHub
[DISCUSS] Java 25 baseline when moving to Storm 3
Richard Zowalla
Re: [DISCUSS] Java 25 baseline when moving to Storm 3
Julien Nioche
Re: [DISCUSS] Java 25 baseline when moving to Storm 3
Davide Polato
[I] Fix dead links [stormcrawler]
via GitHub
Re: [I] Fix dead links [stormcrawler]
via GitHub
Re: [I] Fix dead links [stormcrawler]
via GitHub
Re: [I] Fix dead links [stormcrawler]
via GitHub
[I] Fix dead links [stormcrawler-site]
via GitHub
Re: [I] Fix dead links [stormcrawler-site]
via GitHub
Re: [I] Fix dead links [stormcrawler-site]
via GitHub
Re: [I] Fix dead links [stormcrawler-site]
via GitHub
[PR] Bump org.apache.maven.plugins:maven-surefire-plugin from 3.5.5 to 3.5.6 [stormcrawler]
via GitHub
Re: [PR] Bump org.apache.maven.plugins:maven-surefire-plugin from 3.5.5 to 3.5.6 [stormcrawler]
via GitHub
[PR] Bump langchain4j.version from 1.15.0 to 1.15.1 [stormcrawler]
via GitHub
Re: [PR] Bump langchain4j.version from 1.15.0 to 1.15.1 [stormcrawler]
via GitHub
[PR] Bump tika.version from 3.3.0 to 3.3.1 [stormcrawler]
via GitHub
Re: [PR] Bump tika.version from 3.3.0 to 3.3.1 [stormcrawler]
via GitHub
[PR] StormCrawler 3.6.0 [stormcrawler]
via GitHub
Re: [PR] StormCrawler 3.6.0 [stormcrawler]
via GitHub
[RESULT] [VOTE] Apache StormCrawler 3.6.0 Release Candidate
Richard Zowalla
[VOTE] Apache StormCrawler 3.6.0 Release Candidate
Richard Zowalla
Re: [VOTE] Apache StormCrawler 3.6.0 Release Candidate
Davide Polato
Re: [VOTE] Apache StormCrawler 3.6.0 Release Candidate
Dávid Szigecsán
Re: [VOTE] Apache StormCrawler 3.6.0 Release Candidate
Julien Nioche
Re: [VOTE] Apache StormCrawler 3.6.0 Release Candidate
Markos Volikas
Re: [VOTE] Apache StormCrawler 3.6.0 Release Candidate
Sebastian Nagel
Re: [VOTE] Apache StormCrawler 3.6.0 Release Candidate
Richard Zowalla
[PR] Bump junit.version from 6.0.3 to 6.1.0 [stormcrawler]
via GitHub
Re: [PR] Bump junit.version from 6.0.3 to 6.1.0 [stormcrawler]
via GitHub
[PR] Bump org.apache.maven.plugins:maven-enforcer-plugin from 3.6.2 to 3.6.3 [stormcrawler]
via GitHub
Re: [PR] Bump org.apache.maven.plugins:maven-enforcer-plugin from 3.6.2 to 3.6.3 [stormcrawler]
via GitHub
[PR] Bump com.microsoft.playwright:playwright from 1.59.0 to 1.60.0 [stormcrawler]
via GitHub
Re: [PR] Bump com.microsoft.playwright:playwright from 1.59.0 to 1.60.0 [stormcrawler]
via GitHub
[PR] Upgrade to Storm 2.8.8 [stormcrawler]
via GitHub
Re: [PR] Upgrade to Storm 2.8.8 [stormcrawler]
via GitHub
Concerns about Storm's sustainability and a possible path forward
Richard Zowalla
Re: Concerns about Storm's sustainability and a possible path forward
Dávid Szigecsán
Re: Concerns about Storm's sustainability and a possible path forward
Davide Polato
Re: Concerns about Storm's sustainability and a possible path forward
Julien Nioche
[PR] Bump org.slf4j:slf4j-simple from 2.0.17 to 2.0.18 [stormcrawler]
via GitHub
Re: [PR] Bump org.slf4j:slf4j-simple from 2.0.17 to 2.0.18 [stormcrawler]
via GitHub
[PR] Bump langchain4j.version from 1.14.1 to 1.15.0 [stormcrawler]
via GitHub
Re: [PR] Bump langchain4j.version from 1.14.1 to 1.15.0 [stormcrawler]
via GitHub
[PR] Bump selenium.version from 4.43.0 to 4.44.0 [stormcrawler]
via GitHub
Re: [PR] Bump selenium.version from 4.43.0 to 4.44.0 [stormcrawler]
via GitHub
[GH] (stormcrawler/increase-checkstyle-severities): Workflow run "Java CI with Maven" is working again!
GitBox
[GH] (stormcrawler/increase-checkstyle-severities): Workflow run "Java CI with Maven" failed!
GitBox
[PR] Increase checkstyle severities [stormcrawler]
via GitHub
Re: [PR] Increase checkstyle severities [stormcrawler]
via GitHub
[PR] [INFRA] Set up default rulesets for default and release branches [stormcrawler-site]
via GitHub
Re: [PR] [INFRA] Set up default rulesets for default and release branches [stormcrawler-site]
via GitHub
[GH] (stormcrawler/playwrightTests): Workflow run "Java CI with Maven" is working again!
GitBox
[GH] (stormcrawler/playwrightTests): Workflow run "Java CI with Maven" failed!
GitBox
[GH] (stormcrawler/playwrightTests): Workflow run "Java CI with Maven" failed!
GitBox
[PR] Fix Playwright tests failing when run together [stormcrawler]
via GitHub
Re: [PR] Fix Playwright tests failing when run together [stormcrawler]
via GitHub
Re: [PR] Fix Playwright tests failing when run together [stormcrawler]
via GitHub
Re: [PR] Fix Playwright tests failing when run together [stormcrawler]
via GitHub
Re: [PR] Fix Playwright tests failing when run together [stormcrawler]
via GitHub
[ANN] Welcome new Apache StormCrawler Committer Davide Polato
Richard Zowalla
Earlier messages