All,

  With the explosion of vision models and methods for creating embeddings
from images (and PDFs!), I thought it might be useful to start a wiki page
that captures some of the techniques currently in use.

   There is such dynamism in the document intelligence/document engineering
space that whatever we write is already out-of-date, but nevertheless, I
started this:
https://cwiki.apache.org/confluence/display/TIKA/Resources+for+Advanced+Document+Processing

  Please edit as you see fit.

          Best,

                  Tim

Reply via email to