Re: Crawling Italian language site in Solr

2023-07-28 Thread Markus Jelsma
Hello Fiz, This normally happens when websites are capable of responding with translations of their content. Usually this is controlled by the client's Accept-Lang header, and in worse cases, it is decided based on client apparent IP address. In Nutch you can test its output by using the bin/nutc

Crawling Italian language site in Solr

2023-07-27 Thread Fiz N
Hi SOLR Experts, In Azure VM (Linux), we have installed Solr version 8.11.2 and Nutch Crawler (apache-nutch-1.19). Crawling the site for Italian Language we added the tokenizer. *In the Solr admin screen we could see the document but in English language.* Please see the below attached managed sc