[jira] [Created] (FLINK-1981) Add GZip support

2015-05-06 Thread Sebastian Kruse (JIRA)
Sebastian Kruse created FLINK-1981: -- Summary: Add GZip support Key: FLINK-1981 URL: https://issues.apache.org/jira/browse/FLINK-1981 Project: Flink Issue Type: New Feature

Re: Gzip support

2015-05-04 Thread Robert Metzger
___ > From: Robert Metzger [metrob...@gmail.com] > Sent: Thursday, April 30, 2015 21:01 > To: dev@flink.apache.org > Subject: Re: Gzip support > > There is already support for inflate compressed files and I introduced > logic to handle unsplittable formats. > > >

RE: Gzip support

2015-05-04 Thread Kruse, Sebastian
ticket and a pull request by this week, so that you can have a look at it. Cheers, Sebastian From: Robert Metzger [metrob...@gmail.com] Sent: Thursday, April 30, 2015 21:01 To: dev@flink.apache.org Subject: Re: Gzip support There is already support for

Re: Gzip support

2015-04-30 Thread Robert Metzger
There is already support for inflate compressed files and I introduced logic to handle unsplittable formats. Sent from my iPhone > On 30.04.2015, at 19:39, Stephan Ewen wrote: > > I think that would be very worthwhile :-) Happy to hear that you want to > contribute that! > > Decorating the i

Re: Gzip support

2015-04-30 Thread Stephan Ewen
I think that would be very worthwhile :-) Happy to hear that you want to contribute that! Decorating the input stream sounds like a great approach and would also work for other compression formats. The other thing that needs to be taken into account is that GZIP files are not splittable in the sa

Gzip support

2015-04-30 Thread Kruse, Sebastian
Hi everyone, I just recently came across a use-case where I needed to read gzip files and handle byte order marks transparently. I know that gzip can be read with Hadoop input formats but that did not work for me since I wanted to reuse my existing custom Flink input formats. It turned out tha

Re: Flink GZip support

2015-02-22 Thread Robert Metzger
Hi Karim, also have a look at this old discussion from the user@ list: http://apache-flink-incubator-user-mailing-list-archive.2336050.n4.nabble.com/read-gz-files-td760.html On Sun, Feb 22, 2015 at 10:33 AM, Felix Neutatz wrote: > Hi Karim, > > you can use a Hadoop Input Format and read the f

Re: Flink GZip support

2015-02-22 Thread Felix Neutatz
Hi Karim, you can use a Hadoop Input Format and read the files using flink-hadoop-compatibility classes like here: http://flink.apache.org/docs/0.7-incubating/hadoop_compatibility.html Have a nice Sunday, Felix 2015-02-22 10:02 GMT+01:00 Karim Alaa : > Hi All, > > I’m currently working with Fl

Flink GZip support

2015-02-22 Thread Karim Alaa
Hi All, I’m currently working with Flink 0.8.0 and I would like to know if there is or will be any support for handling Gzipped files Thanks!