Wiki page for advanced document processing

Tim Allison Thu, 22 Aug 2024 04:39:31 -0700

All,

  With the explosion of vision models and methods for creating embeddings
from images (and PDFs!), I thought it might be useful to start a wiki page
that captures some of the techniques currently in use.


   There is such dynamism in the document intelligence/document engineering
space that whatever we write is already out-of-date, but nevertheless, I
started this:
https://cwiki.apache.org/confluence/display/TIKA/Resources+for+Advanced+Document+Processing

  Please edit as you see fit.

          Best,

                  Tim

Wiki page for advanced document processing

Reply via email to