OK I should have realized I need to add a request handler for extract, I
guess ping just hits the base url. NM

On Tue, Mar 4, 2025 at 5:30 PM Robi Petersen <robip...@gmail.com> wrote:

> it's odd, the new solr post tool is getting a 404 off the std example of
> posting a web crawl? I can ping the endpoint... and solr is definitely up
> and running, admin page is fine. Am I daft or something? :/
>
> ping http://localhost:8983/solr/gettingstarted/update/extract
>
> PING http://localhost:8983/solr/gettingstarted/update/extract
> (143.244.220.150): 56 data bytes
>
> 64 bytes from 143.244.220.150: icmp_seq=0 ttl=46 time=88.446 ms
>
> 64 bytes from 143.244.220.150: icmp_seq=1 ttl=46 time=87.999 ms
>
> bin/solr post --mode web -c gettingstarted --recursive 1 --delay 1
> https://solr.apache.org/
>
> Posting web pages to Solr url
> http://localhost:8983/solr/gettingstarted/update/extract
>
> Entering auto mode. Indexing pages with content-types corresponding to
> file endings
> xml,json,jsonl,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log
>
> PostTool: WARNING: Never crawl an external web site faster than every 10
> seconds, your IP will probably be blocked
>
> Entering recursive mode, depth=1, delay=1s
>
> Entering crawl at level 0 (1 links total, 1 new)
>
> PostTool: WARNING: Solr returned an error #404 (Not Found) for url:
> http://localhost:8983/solr/gettingstarted/update/extract?literal.id=https%3A%2F%2Fsolr.apache.org&literal.url=https%3A%2F%2Fsolr.apache.org
>
> PostTool: WARNING: Response: <p>
>
>   Searching for Solr?<br/>
>
>   You must type the correct path.<br/>
>
>   Solr will respond.
>
> </p>
>
> PostTool: WARNING: IOException while reading response:
> java.io.FileNotFoundException:
> http://localhost:8983/solr/gettingstarted/update/extract?literal.id=https%3A%2F%2Fsolr.apache.org&literal.url=https%3A%2F%2Fsolr.apache.org
>
>
>
>

Reply via email to