On Thursday 09 November 2006 19:55, Renaud Waldura wrote:
> I'm thinking I might have to tokenize the document text (I have it),
> then compute the intersection between the set of all terms and the set
> of terms from the rewritten query. Blech. Sounds expensive. Any other
> ideas?
No faster, but
Greetings:
I read the mailing-list archives about this topic and found the PDFBox
solutions at: http://www.pdfbox.org/userguide/highlighting.html
Basically there are 3 options:
1- append query parameters to the PDF URL
2- generate a highlight XML document that Acrobat Reader will download
separa