OK I should have realized I need to add a request handler for extract, I guess ping just hits the base url. NM
On Tue, Mar 4, 2025 at 5:30 PM Robi Petersen <robip...@gmail.com> wrote: > it's odd, the new solr post tool is getting a 404 off the std example of > posting a web crawl? I can ping the endpoint... and solr is definitely up > and running, admin page is fine. Am I daft or something? :/ > > ping http://localhost:8983/solr/gettingstarted/update/extract > > PING http://localhost:8983/solr/gettingstarted/update/extract > (143.244.220.150): 56 data bytes > > 64 bytes from 143.244.220.150: icmp_seq=0 ttl=46 time=88.446 ms > > 64 bytes from 143.244.220.150: icmp_seq=1 ttl=46 time=87.999 ms > > bin/solr post --mode web -c gettingstarted --recursive 1 --delay 1 > https://solr.apache.org/ > > Posting web pages to Solr url > http://localhost:8983/solr/gettingstarted/update/extract > > Entering auto mode. Indexing pages with content-types corresponding to > file endings > xml,json,jsonl,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log > > PostTool: WARNING: Never crawl an external web site faster than every 10 > seconds, your IP will probably be blocked > > Entering recursive mode, depth=1, delay=1s > > Entering crawl at level 0 (1 links total, 1 new) > > PostTool: WARNING: Solr returned an error #404 (Not Found) for url: > http://localhost:8983/solr/gettingstarted/update/extract?literal.id=https%3A%2F%2Fsolr.apache.org&literal.url=https%3A%2F%2Fsolr.apache.org > > PostTool: WARNING: Response: <p> > > Searching for Solr?<br/> > > You must type the correct path.<br/> > > Solr will respond. > > </p> > > PostTool: WARNING: IOException while reading response: > java.io.FileNotFoundException: > http://localhost:8983/solr/gettingstarted/update/extract?literal.id=https%3A%2F%2Fsolr.apache.org&literal.url=https%3A%2F%2Fsolr.apache.org > > > >