The way I got Swish to index org files was to create a script that generated an xml file (https://kitchingroup.cheme.cmu.edu/blog/2015/07/06/Indexing-headlines-in-org-files-with-swish-e-with-laser-sharp-results/) or html (http://kitchingroup.cheme.cmu.edu/blog/2015/07/03/Using-swish-e-to-index-org-files-as-html/) that it could index. This is probably a general strategy for these tools.
Eric Abrahamsen <e...@ericabrahamsen.net> writes: > Roland Everaert <reveatw...@gmail.com> writes: > >> Hello all, >> >> I am interested in a search/indexing engine targeting the org format, >> too. >> >> My interest comes from the fact that I have a growing number of org >> files and as org-mode has no file archiving feature, AFAIK, searching >> needs more and more time to complete. >> >> Moving files, that are no more necessary, outside of my org-directories, >> can be tedious and prone to moving the wrong file to the wrong location. >> >> Hence, an indexer could comes in handy, especially if it is optimised >> for the Org format (i.e.: it knows what are categories, tags, >> properties, etc in an Org file). > > I think this last point is key. Most full-text search engines provide > config options for defining fields, or "facets", which in theory we > could set up to parse tags/properties/timestamps. My guess is that any > of the major contenders (solr, xapian, lucene) would work pretty much as > well as any of the others -- for our purposes, they probably only differ > in the details. Xapian might be considered "in the family" from a > license standpoint, but I don't know that that matters too much. > > It would be fun to provide an Org indexing config for one of these > engines, and then build the Agenda on top of it. -- Professor John Kitchin Doherty Hall A207F Department of Chemical Engineering Carnegie Mellon University Pittsburgh, PA 15213 412-268-7803 @johnkitchin http://kitchingroup.cheme.cmu.edu