[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tim Allison updated TIKA-1851: ------------------------------ Attachment: tika_2x_test_files_and_modules.xlsx I'm attaching the output of a bit of hackery to find which files are used in which modules. ***WARNING: the script pulled file names from anywhere in the java source code, including comments and unit tests so there are some false positives (e.g. a unit test in the SQLite parser that ensures that an embedded testWORD_1img.doc is extracted) *** If we assume that tika-parsers would be able to draw all test files from the child modules within tika-parser-modules, there appear to be only 15ish files that are used in more than one child parser-module. This might help us figure out what the best arrangement is. [~kkrugler], y, I'm happy to move the classes back to tika-core, and I'd slightly prefer to move the test files to their appropriate modules. How do we handle the 15ish files? Are there other items that should cause concern with this move? What are the other options? [~lewismc], do I understand correctly, that you'd be against putting the test files into the specific parser modules? Thank you, all! > Tika 2.0 - Move test resources from core to test-resources > ---------------------------------------------------------- > > Key: TIKA-1851 > URL: https://issues.apache.org/jira/browse/TIKA-1851 > Project: Tika > Issue Type: Sub-task > Reporter: Tim Allison > Assignee: Tim Allison > Priority: Trivial > Fix For: 2.0 > > Attachments: tika_2x_test_files_and_modules.xlsx > > > Let's try to move resources that are used for testing to the test-resources > module if possible: MockParser, DummyParser, TikaTest and the unit tests for > MockParser. That should also allow us to drop the test-jar goal in > tika-core. Anything else? > Haven't actually tried this yet; there may be surprises. -- This message was sent by Atlassian JIRA (v6.3.4#6332)