A very, very long time ago (like 5 years ago) I implemented the Justext boilerplate removal algorithm in Go. I found it useful when I was messing around with my own Readability fun project:
https://github.com/JalfResi/justext The code is probably awful (it was one of the projects I used to learn Go) but the algorithm is sound if you want to grab it. On Friday, June 9, 2017 at 2:17:24 PM UTC+1, zhengc...@gmail.com wrote: > > Hello, everyone, > > i'm create an XPath <https://github.com/antchfx/xpath>project in the > past few months,that implemented most of XPath functions and location paths. > > Now XPath package has supported Compile feature that can compile xpath > expression into reused and evaluate values for specified nodes. > > I'm also create a XQuery <https://github.com/antchfx/xquery>project for > XPath project, it's XPath package implemented,that can lets your extract > data from HTML/XML documents using xpath expression. > > oh,yes, i'm also create an interesting project,that can makes web pages > more readable like safari readability,call readability > <https://github.com/antchfx/readability>, it also using these projects. > > I hope these project could help you. :) > > > XPath project: https://github.com/antchfx/xpath > > XQuery project: https://github.com/antchfx/xquery > <https://github.com/antchfx/xquery> > > Readability project: https://github.com/antchfx/readability > > -- You received this message because you are subscribed to the Google Groups "golang-nuts" group. To unsubscribe from this group and stop receiving emails from it, send an email to golang-nuts+unsubscr...@googlegroups.com. For more options, visit https://groups.google.com/d/optout.