Hello Fiz,
This normally happens when websites are capable of responding with
translations of their content. Usually this is controlled by the client's
Accept-Lang header, and in worse cases, it is decided based on client
apparent IP address.
In Nutch you can test its output by using the bin/nutc
Hi SOLR Experts,
In Azure VM (Linux), we have installed Solr version 8.11.2 and Nutch
Crawler (apache-nutch-1.19). Crawling the site for Italian Language we
added the tokenizer. *In the Solr admin screen we could see the document
but in English language.*
Please see the below attached managed sc