Fixed the <nav> issue (HTML <nav> elements pollution of relevancy by existing on all pages.)... serves as an excellent example of how little effort it takes to add a custom processor in a JesterJ project
https://github.com/nsoft/index-solr-ref-guide/issues/1 Now search q=hdfs only matches 7 pages, not all of them :) On Fri, Feb 16, 2024 at 9:50 PM Alexandre Rafalovitch <arafa...@gmail.com> wrote: > Boy, do I remember this "I did a cool thing and nobody looked" feeling for > Solr RefGuide. > > But if it could be useful for this project, my Guide import code is still > public. I actually read the content straight from ASCIIDoc internal > representation as opposed to Tika: > > https://github.com/arafalov/solr-refguide-indexing/blob/master/src/com/solrstart/refguide/Indexer.java > > > Regards, > Alex. > P.s. JesterJ does look interesting, though not related to what I am doing > right now (so not digging deeper). > > > > On Fri, 16 Feb 2024 at 21:12, Gus Heck <gus.h...@gmail.com> wrote: > > > Hi folks, > > > > *TLDR;* I put up a github repo (check it out): > > https://github.com/nsoft/index-solr-ref-guide > > > > *The Details:* > > Last Year I announced JesterJ's 1.0 release and gave a lightning talk > about > > it at Haystack. There were lots of folks who seemed to think it sounded > > cool, but I got zero useful feedback, and no evidence that even one > person > > downloaded and ran it. This was very discouraging, and I wound up > ignoring > > it for a while. > > > > I suspect the problem is that trying it out on a project at work is too > > high stakes, and the majority of folks attending conferences already have > > some sort of working solution. Also coming up with data to index that > isn't > > entirely pointless can be a chore. > > > > Tonight I was thinking a thought I've had many times before: It's kind of > > silly that the Solr ref guide isn't indexed and searchable in a solr > > instance. Currently the ref guide uses a JS based library which has the > > downside of not being able to serve results for past versions. > > > > Then it hit me. This is a great, low stakes way for people to play with > > JesterJ and try it out without having to go search for data. So tonight I > > took 30 min and wrote an ingest to consume Solr Ref Guide (it was that > > easy! almost faster than building the html site) and and now I have a > repo > > that anyone can check out, and fiddle with, that doesn't include the > whole > > JesterJ project. > > > > It's intentionally just a starting point. Please give feedback or even > > PR's. I made a couple of issues noting the most obvious enhancements. > > > > I am aware that there have been attempts to do this in the past, and it's > > non-trivial to do it well, but I hope this can be a fun community effort > > with no time pressure, and opportunities to learn lots of different > things. > > > > Enjoy, > > Gus > > > > -- > > http://www.needhamsoftware.com (work) > > https://a.co/d/b2sZLD9 (my fantasy fiction book) > > > -- http://www.needhamsoftware.com (work) https://a.co/d/b2sZLD9 (my fantasy fiction book)