[ https://issues.apache.org/jira/browse/TIKA-4316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tim Allison updated TIKA-4316: ------------------------------ Description: I proposed a tentative roadmap here: https://lists.apache.org/thread/9yfzf6qwpc7c6qnlp4tdwsdrnjvv7r1z Let's use this ticket to discuss some high level changes in 4.x Some thoughts: 1) Require Java 17 2) Remove tika-batch in favor of tika-pipes with filesystem dependencies 3) Move tika-pipes to a separate module. Consider moving non-trivial implementations of tika-pipes components to a separate project? Consider using pf4j in tika-pipes and other components? 4) Remove unsupported dl4j and sentiment analysis and agepredictor modules and...? 5) Avoid fat jars where possible -- at least move tika-server to a lib/* pattern with the assembly plugin or pf4j instead of the shade plugin 6) Use an auto-correcting linter instead of checkstyle (cosium with google's style format?) 7) Remove the legacy external parser mechanism in favor of the external2 mechanism was: I proposed a tentative roadmap here: https://lists.apache.org/thread/9yfzf6qwpc7c6qnlp4tdwsdrnjvv7r1z Let's use this ticket to discuss some high level changes in 4.x Some thoughts: 1) Require Java 17 2) Remove tika-batch in favor of tika-pipes with filesystem dependencies 3) Move tika-pipes to a separate module. Consider moving non-trivial implementations of tika-pipes components to a separate project? 4) Remove unsupported dl4j and sentiment analysis modules and...? 5) Avoid fat jars where possible -- at least move tika-server to a lib/* pattern with the assembly plugin instead of the shade plugin 6) Use an auto-correcting linter instead of checkstyle 7) Remove the legacy external parser mechanism in favor of the external2 mechanism > Goals for Tika 4.x > ------------------ > > Key: TIKA-4316 > URL: https://issues.apache.org/jira/browse/TIKA-4316 > Project: Tika > Issue Type: Task > Reporter: Tim Allison > Priority: Major > > I proposed a tentative roadmap here: > https://lists.apache.org/thread/9yfzf6qwpc7c6qnlp4tdwsdrnjvv7r1z > Let's use this ticket to discuss some high level changes in 4.x > Some thoughts: > 1) Require Java 17 > 2) Remove tika-batch in favor of tika-pipes with filesystem dependencies > 3) Move tika-pipes to a separate module. Consider moving non-trivial > implementations of tika-pipes components to a separate project? Consider > using pf4j in tika-pipes and other components? > 4) Remove unsupported dl4j and sentiment analysis and agepredictor modules > and...? > 5) Avoid fat jars where possible -- at least move tika-server to a lib/* > pattern with the assembly plugin or pf4j instead of the shade plugin > 6) Use an auto-correcting linter instead of checkstyle (cosium with google's > style format?) > 7) Remove the legacy external parser mechanism in favor of the external2 > mechanism -- This message was sent by Atlassian Jira (v8.20.10#820010)