GitHub user mtanski opened a pull request:
https://github.com/apache/flink/pull/2002
Support for bz2 compression in flink-core
Add support for bz2 compression to flink. Right now this requires using
Hadoop InputFormats.
Doesn't require any extra dependencies as flink-core already uses the
Apache common-io package. Doesn't support splitting. the current compression
support in Flink would need to be reworked to support that.
It's possible that Flink should use use apache common-io for compression.
This way it would get support support for snappy, bz2, xz, lzma in addition to
gz & deflate without pulling in extra dependencies.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/adfin/flink bz2
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/flink/pull/2002.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2002
----
commit 297aea88f7b5f832898b380e214f9d4080594092
Author: Milosz Tanski <[email protected]>
Date: 2016-05-18T04:04:36Z
Support for bz2 compression in flink-core.
Add support for bz2 compression to flink. Right now this requires using
Hadoop
InputFormats.
Doesn't require any extra dependencies as flink-core already uses the Apache
common-io package.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---