Michael, thank you for bringing to attention. I'm reviewing with the folks
who created the site and I/they will reply with next steps.


On Sat, Nov 6, 2021 at 10:38 AM Michael Shuler <mich...@pbandjelly.org>
wrote:

> I overwrote the result link - much better, no more 429s.
>
> https://12.am/tmp/cassandra.apache.org_muffet.log.txt
>
> - lots of page anchor problems
> - quite a few busted links
> - quite a few hosts that are gone
> - one link timeout
> - (a few "error" reports are each 200s, just headers)
>
> $ egrep '^\sid #' c*.log.txt |wc -l
> 1416
> $ egrep '^\s4' c*.log.txt |wc -l
> 55
> $ egrep '^\slookup' c*.log.txt |wc -l
> 20
> $ egrep '^\stimeout' c*.log.txt |wc -l
> 1
>
> Warm regards,
> Michael
>
> On 11/6/21 11:59 AM, Michael Shuler wrote:
> > FYI - I'm going to try to slow down the checks, since I just noticed a
> > bunch of the 4xx errors are "HTTP 429 Too Many Requests"
> >
> > Kind regards,
> > Michael
> >
> > On 11/6/21 11:52 AM, Michael Shuler wrote:
> >> (Sending to dev@ which seems a better place to discuss; updated
> >> subject. Thanks OP!)
> >>
> >> I ran a couple link checking tools on the site and there are lots more
> >> problems than the couple noted. This seems like a good task for a
> >> non-dev to make a substantial project impact. Muffet [0] seemed the
> >> quickest way to get some decent output. I grabbed the v2.4.4 binary
> >> release [1]; tar xzvf .., and:
> >>
> >> $ ./muffet https://cassandra.apache.org/ \
> >>   | tee -a cassandra.apache.org_muffet.log.txt
> >>
> >> result (2950 lines):
> >> https://12.am/tmp/cassandra.apache.org_muffet.log.txt
> >>
> >> $ egrep '^\s4' cassandra.apache.org_muffet.log.txt \
> >>   | wc -l
> >> 841
> >> $ egrep '^\sid #' cassandra.apache.org_muffet.log.txt \
> >>   | wc -l
> >> 1401
> >>
> >> [0] https://github.com/raviqqe/muffet
> >> [1] https://github.com/raviqqe/muffet/releases
> >>
> >> Kind regards,
> >> Michael
> >>
> >> On 11/5/21 4:09 PM, Greg Stein wrote:
> >>> see below:
> >>>
> >>> ---------- Forwarded message ---------
> >>> From: *Hubert Kulas* <hubertzku...@gmail.com
> >>> <mailto:hubertzku...@gmail.com>>
> >>> Date: Fri, Nov 5, 2021 at 1:29 PM
> >>> Subject: Not working links
> >>> To: <webmas...@apache.org <mailto:webmas...@apache.org>>
> >>>
> >>>
> >>> Hi,
> >>>
> >>> I am writing my thesis about big data and I was doing some research
> >>> about real-world use cases of Cassandra. While doing that I found
> >>> that after clicking "read more" under 'Coursera'  leads us to
> >>> DataStax website where we are greeted with "You do not have access to
> >>> view this page" message. To reproduce it just go to
> >>> https://cassandra.apache.org/_/case-studies.html
> >>> <https://cassandra.apache.org/_/case-studies.html> and then find
> >>> Coursera and click "read more".  Then after trying to find a way to
> >>> contact you guys about the problem I encountered another problem on
> >>> this part of the website
> >>> https://cassandra.apache.org/doc/3.11.5/contactus.html
> >>> <https://cassandra.apache.org/doc/3.11.5/contactus.html>
> >>> After clicking the icon leads us to
> >>> https://cassandra.apache.org/feed.xml
> >>> <https://cassandra.apache.org/feed.xml> which gives us the 404 Not
> >>> Found message.
> >>> 2021-11-05_19h26_44.png
> >>>
> >>> Best Regards,
> >>> Hubert Kulas
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
> For additional commands, e-mail: dev-h...@cassandra.apache.org
>
>

-- 
Melissa Logan (she/her)
Principal, Constantia.io
meli...@constantia.io
Cell: 503-317-8498
LinkedIn <https://www.linkedin.com/in/mklogan/> | Twitter
<https://twitter.com/Melissa_B2B>

Reply via email to