All,
  I was recently chatting about Tika 2.x with some Tika friends and
they had some hesitation about the names for the three high level
parser modules.

They are currently:

tika-parsers-classic
tika-parsers-extended
tika-parsers-advanced

The quibbles weren't with the delineation, but with the naming.

In my mind, this is what I've been thinking as definitions:

tika-parsers-classic -- with the exception of optional OCR, these
should be lightish weight dependencies in pure java with no
parsers/resources that require network calls.

tika-parsers-extended -- these can require native libs and/or have
heavier dependencies, including network calls.

tika-parsers-advanced -- anything goes. dl4j as a dependency, etc.

Some options for classic-> basic, base, ...what else?

Any other recommendations for these names?  Thank you!

Best,

           Tim

Reply via email to