Hello Tim, Thanks for the interesting suggestion :) I'll take a look at the issue and give it a try!
I'll reach out if I have any questions. Best Regards, Yongjun > Hi Yongjun, > > I'm sorry for my delay. One resource that would be handy to have (if it > doesn't exist) would be documentation on what parameters can be set per > domain in the `seeds.txt` file. I've found a few by looking through the > source code, and this might be a useful exercise for you. The output would > be useful to all (including me :D)! > > An example would be `max.depth` as added here: > https://github.com/apache/incubator-stormcrawler/issues/399 > > On Sun, Jan 26, 2025 at 7:14 AM Yongjun Hong <dev.yongj...@gmail.com> > wrote: > > > Thank you for your attention. > > > > The email sent from kevin...@ was actually a mistake, as I confused the > > accounts. > > I will use the dev...@ email going forward. > > > > Have a great day! > > > > > > > Hi, > > > > > > Yep. I just verified it. > > > > > > I think, that the issue is, that mails from your kevin…@ are put into > > > moderation. > > > So if you are using dev….@ it should just work fine. > > > > > > Gruß > > > Richard > > > > > > > Am 26.01.2025 um 10:14 schrieb 홍용준 <dev.yongj...@gmail.com>: > > > > > > > > It seems that I am already subscribed to the mailing list. > > > > I reached out to dev-subscr...@stormcrawler.apache.org <mailto: > > > dev-subscr...@stormcrawler.apache.org> and received a > > > > response indicating that my email address is already registered. > > > > > > > > This is my first time participating in the Apache mailing list and > > > writing > > > > messages, so I am still very new to this process. > > > > If I am doing something wrong or missing anything important, I would > > > > greatly appreciate it if you could let me know.🥹 > > > > > > > > 2025년 1월 26일 (일) 오후 5:54, Richard Zowalla <r...@apache.org <mailto: > > > r...@apache.org>>님이 작성: > > > > > > > >> Hi, > > > >> > > > >> Maybe it would be also good to subscribe to the dev@ list? :) > > > >> You’re mails are going into moderation and that would be away to > > > >> circumvent that. > > > >> > > > >> Gruß > > > >> Richard > > > >> > > > >>> Am 26.01.2025 um 05:52 schrieb 홍용준 <kevin09288...@gmail.com>: > > > >>> > > > >>> 다음은 자연스럽게 번역한 이메일입니다: > > > >>> ------------------------------ > > > >>> > > > >>> Hi Julien, > > > >>> > > > >>> I apologize for the delayed response. > > > >>> I’m still getting used to the Apache Mailing list, so I didn’t > > realize > > > I > > > >>> had received a reply. 😅 > > > >>> > > > >>> As suggested, I will start by running StormCrawler. > > > >>> Once I’ve built up some basic knowledge about the project, I will > > > review > > > >>> the discussion you shared and follow up with my thoughts. > > > >>> > > > >>> Thank you! > > > >>> > > > >>> Best regards, > > > >>> Yongjun Hong > > > >>> > > > >>> On 2025/01/22 20:00:51 Julien Nioche wrote: > > > >>>> Hi Yongjun Hong, > > > >>>> > > > >>>> Thanks for your email and your interest in contributing to > > > StormCrawler. > > > >>>> The project is quite mature and there isn't anything that is > totally > > > >>>> missing from it but there definitely are improvements to be made. > > > >>>> Contributions take many forms and not just code: a good starting > > point > > > >>>> would be to run StormCrawler, look at the existing resources and > > > >>>> documentation and see if anything is not clear or missing. > Reporting > > > >> bugs > > > >>>> or asking questions is also very valuable. > > > >>>> > > > >>>> One thing I would like to see however is a new spout > implementation > > > for > > > >>>> OpenSearch; I had written a description for Elasticsearch a while > > ago > > > >>>> <https://github.com/apache/incubator-stormcrawler/discussions/990 > > > > > >> which > > > >>> is > > > >>>> valid for OpenSearch and could improve the performance > > significantly. > > > >>>> Probably a bit of an advanced topic though, maybe as a start just > > run > > > >>>> StormCrawler, delve into the details and see what you think? > > > >>>> > > > >>>> Thanks! > > > >>>> > > > >>>> Julien > > > >>>> > > > >>>> > > > >>>> On Tue, 21 Jan 2025 at 23:57, 홍용준 <de...@gmail.com < > > http://gmail.com/> > > > <http://gmail.com/>> > > > >> wrote: > > > >>>> > > > >>>>> Dear StormCrawler Team, > > > >>>>> > > > >>>>> My name is Yongjun Hong, and I am a software developer with a > > strong > > > >>>>> interest in contributing to open-source projects. I have had the > > > >>> privilege > > > >>>>> of contributing to JUnit 5 by implementing performance > improvements > > > and > > > >>>>> adding new features, which allowed me to gain a deeper > appreciation > > > for > > > >>>>> collaborative development within the open-source community. > > > >>>>> > > > >>>>> Beyond JUnit, I have also made frequent contributions to other > > > >>> Java-based > > > >>>>> open-source libraries, including Spring Boot. My work has focused > > on > > > >>>>> enhancing core functionality and resolving community-reported > > issues, > > > >>>>> aiming to improve both usability and reliability. > > > >>>>> > > > >>>>> I am particularly drawn to StormCrawler because of its robust and > > > >>> scalable > > > >>>>> web crawling capabilities, and I am eager to contribute > > consistently❕ > > > >> To > > > >>>>> better understand how I can align my efforts with the project’s > > > goals, > > > >> I > > > >>>>> would like to inquire about its current status, overall roadmap, > > and > > > >> any > > > >>>>> key milestones in the near future. > > > >>>>> > > > >>>>> If there are areas where the project could use additional > support, > > or > > > >> if > > > >>>>> there are specific tasks or issues you think I could assist > with, I > > > >>> would > > > >>>>> greatly appreciate your guidance. > > > >>>>> > > > >>>>> Thank you for your time and for the incredible work you have done > > on > > > >>>>> stormcrawler. I am excited about the possibility of contributing > > and > > > >>> look > > > >>>>> forward to hearing from you. > > > >>>>> > > > >>>>> Here is my Github profile : https://github.com/YongGoose > > > >>>>> > > > >>>>> Best regards > > > >>>>> > > > >>>>> Yongjun Hong > > > > > > > > >