Fixed the <nav> issue (HTML <nav> elements pollution of relevancy by
existing on all pages.)... serves as an excellent example of how little
effort it takes to add a custom processor in a JesterJ project

https://github.com/nsoft/index-solr-ref-guide/issues/1

Now search q=hdfs only matches 7 pages, not all of them :)



On Fri, Feb 16, 2024 at 9:50 PM Alexandre Rafalovitch <arafa...@gmail.com>
wrote:

> Boy, do I remember this "I did a cool thing and nobody looked" feeling for
> Solr RefGuide.
>
> But if it could be useful for this project, my Guide import code is still
> public. I actually read the content straight from ASCIIDoc internal
> representation as opposed to Tika:
>
> https://github.com/arafalov/solr-refguide-indexing/blob/master/src/com/solrstart/refguide/Indexer.java
>
>
> Regards,
>    Alex.
> P.s.  JesterJ does look interesting, though not related to what I am doing
> right now (so not digging deeper).
>
>
>
> On Fri, 16 Feb 2024 at 21:12, Gus Heck <gus.h...@gmail.com> wrote:
>
> > Hi folks,
> >
> > *TLDR;* I put up a github repo (check it out):
> > https://github.com/nsoft/index-solr-ref-guide
> >
> > *The Details:*
> > Last Year I announced JesterJ's 1.0 release and gave a lightning talk
> about
> > it at Haystack. There were lots of folks who seemed to think it sounded
> > cool, but I got zero useful feedback, and no evidence that even one
> person
> > downloaded and ran it. This was very discouraging, and I wound up
> ignoring
> > it for a while.
> >
> > I suspect the problem is that trying it out on a project at work is too
> > high stakes, and the majority of folks attending conferences already have
> > some sort of working solution. Also coming up with data to index that
> isn't
> > entirely pointless can be a chore.
> >
> > Tonight I was thinking a thought I've had many times before: It's kind of
> > silly that the Solr ref guide isn't indexed and searchable in a solr
> > instance. Currently the ref guide uses a JS based library which has the
> > downside of not being able to serve results for past versions.
> >
> > Then it hit me. This is a great, low stakes way for people to play with
> > JesterJ and try it out without having to go search for data. So tonight I
> > took 30 min and wrote an ingest to consume Solr Ref Guide (it was that
> > easy! almost faster than building the html site) and and now I have a
> repo
> > that anyone can check out, and fiddle with, that doesn't include the
> whole
> > JesterJ project.
> >
> > It's intentionally just a starting point. Please give feedback or even
> > PR's. I made a couple of issues noting the most obvious enhancements.
> >
> > I am aware that there have been attempts to do this in the past, and it's
> > non-trivial to do it well, but  I hope this can be a fun community effort
> > with no time pressure, and opportunities to learn lots of different
> things.
> >
> > Enjoy,
> > Gus
> >
> > --
> > http://www.needhamsoftware.com (work)
> > https://a.co/d/b2sZLD9 (my fantasy fiction book)
> >
>


-- 
http://www.needhamsoftware.com (work)
https://a.co/d/b2sZLD9 (my fantasy fiction book)

Reply via email to