Re: Is it possible to do state migration with checkpoints?

2020-07-23 Thread Sivaprasanna
Adding dev@ to get some traction. Any help would be greatly appreciated. Thanks. On Thu, Jul 23, 2020 at 11:48 AM Sivaprasanna wrote: > +user...@flink.apache.org > > A follow up question. I tried taking a savepoint but the job failed > immediately. It happens everytime I take a sa

[jira] [Created] (FLINK-18013) Refactor Hadoop Utility classes/functions into a common Maven module

2020-05-28 Thread Sivaprasanna Sethuraman (Jira)
Sivaprasanna Sethuraman created FLINK-18013: --- Summary: Refactor Hadoop Utility classes/functions into a common Maven module Key: FLINK-18013 URL: https://issues.apache.org/jira/browse/FLINK-18013

Re: [DISCUSS] Introduce a new module 'flink-hadoop-utils'

2020-05-28 Thread Sivaprasanna
FYI. I created a Jira to track this improvement. https://issues.apache.org/jira/browse/FLINK-18013 - Sivaprasanna On Thu, May 28, 2020 at 12:22 PM Sivaprasanna wrote: > Awesome. : ) > Thanks, Robert for signing up to be the reviewer. I will create Jira and > share the link here. >

Re: [DISCUSS] Introduce a new module 'flink-hadoop-utils'

2020-05-27 Thread Sivaprasanna
Awesome. : ) Thanks, Robert for signing up to be the reviewer. I will create Jira and share the link here. Stay safe. - Sivaprasanna On Thu, May 28, 2020 at 12:13 PM Robert Metzger wrote: > Hi Sivaprasanna, > > thanks a lot for your proposal. Now that I ran into a HadoopUtils-relate

Re: [PROPOSAL] Google Season of Docs 2020.

2020-05-11 Thread Sivaprasanna
Awesome. Great job. On Mon, 11 May 2020 at 7:22 PM, Seth Wiesman wrote: > Thank you for putting this together Marta! > > On Mon, May 11, 2020 at 8:35 AM Fabian Hueske wrote: > > > Thanks Marta and congratulations! > > > > Am Mo., 11. Mai 2020 um 14:55 Uhr schrieb Robert Metzger < > > rmetz...@a

Re: [DISCUSS] Introduce a new module 'flink-hadoop-utils'

2020-04-30 Thread Sivaprasanna
CompressWriterFactory. CC - Kostas Kloudas since he has a better understanding on the `SerializableHadoopConfiguration.` Cheers, Sivaprasanna On Mon, Mar 30, 2020 at 3:17 PM Chesnay Schepler wrote: > I would recommend to wait until a committer has signed up for reviewing > your changes

Re: [DISCUSS] Adding support for Hadoop 3 and removing flink-shaded-hadoop

2020-04-22 Thread Sivaprasanna
I agree with Aljoscha. Otherwise I can see a lot of tickets getting created saying the application is not running on YARN. Cheers, Sivaprasanna On Wed, Apr 22, 2020 at 1:00 PM Aljoscha Krettek wrote: > +1 to getting rid of flink-shaded-hadoop. But we need to document how > people can now

[jira] [Created] (FLINK-17221) Flink on YARN fails with NoClassDefFoundError: com/sun/jersey/core/util/FeaturesAndProperties

2020-04-17 Thread Sivaprasanna Sethuraman (Jira)
Sivaprasanna Sethuraman created FLINK-17221: --- Summary: Flink on YARN fails with NoClassDefFoundError: com/sun/jersey/core/util/FeaturesAndProperties Key: FLINK-17221 URL: https://issues.apache.org/jira

Are there any link between the flink project.version and flink.shaded.version

2020-04-17 Thread Sivaprasanna
I see quite a few flink.shaded.version in maven repo like 9.0, 10.0, 11.0 being the latest. I just wanted to understand how to decide which flink.shaded.version to use in the user application? Thanks :)

Re: flink-orc or flink-orc-nohive

2020-04-14 Thread Sivaprasanna
I have created a ticket to update the ORC version. https://issues.apache.org/jira/browse/FLINK-17142 On Tue, Apr 14, 2020 at 8:18 PM Jingsong Li wrote: > Hi, yes, we can bump orc-core version to a newer. > > Best, > Jingsong Lee > > On Tue, Apr 14, 2020 at 8:16 PM Siv

[jira] [Created] (FLINK-17142) Bump ORC version

2020-04-14 Thread Sivaprasanna Sethuraman (Jira)
Sivaprasanna Sethuraman created FLINK-17142: --- Summary: Bump ORC version Key: FLINK-17142 URL: https://issues.apache.org/jira/browse/FLINK-17142 Project: Flink Issue Type

Re: flink-orc or flink-orc-nohive

2020-04-14 Thread Sivaprasanna
On a similar note, I just checked that the Flink currently uses orc 1.4.3 in the dependencies. IMO, it is a little outdated. Can we bump the ORC version to a slightly newer version - maybe 1.5.x or even 1.6.0? - Sivaprasanna On Tue, Apr 14, 2020 at 1:42 PM Jingsong Li wrote: > Hi, > &

flink-orc or flink-orc-nohive

2020-04-14 Thread Sivaprasanna
Hello, I'm working on an implementation of ORC BulkWriter[1]. As of now, I have the entire implementation in a separate module called "flink-orc-compress" under "flink-formats" since I'm not entirely sure whether it should go into the existing ORC modules i.e flink-orc & flink-orc-nohive. So my q

Re: [DISCUSS] Introduce a new module 'flink-hadoop-utils'

2020-03-30 Thread Sivaprasanna
Hello Till, I agree with having the scope limited and more concentrated. I can file a Jira and get started with the code changes, as and when someone has some bandwidth, the review can also be done. What do you think? Cheers, Sivaprasanna On Mon, Mar 30, 2020 at 3:00 PM Till Rohrmann wrote

Re: SerializableHadoopConfiguration

2020-03-26 Thread Sivaprasanna
e to improve/edit the content to make it better. Thanks, Sivaprasanna On Tue, Mar 17, 2020 at 8:22 PM Sivaprasanna wrote: > Hi Till, > > Sure. I'll take a look and start a discuss thread soon. > > Thanks, > Sivaprasanna > > On Mon, Mar 16, 2020 at 4:01 PM Till Rohrma

[DISCUSS] Introduce a new module 'flink-hadoop-utils'

2020-03-19 Thread Sivaprasanna
utputFormatBase Can we somehow figure and have them in this new module? Thanks, Sivaprasanna [1] https://lists.apache.org/thread.html/r198f09496ba46885adbcc41fe778a7a34ad1cd685eeae8beb71e6fbb%40%3Cdev.flink.apache.org%3E

Re: SerializableHadoopConfiguration

2020-03-17 Thread Sivaprasanna
Hi Till, Sure. I'll take a look and start a discuss thread soon. Thanks, Sivaprasanna On Mon, Mar 16, 2020 at 4:01 PM Till Rohrmann wrote: > Hi Sivaprasanna, > > do you want to collect the set of Hadoop utility classes which could be > moved to a flink-hadoop-utils module an

Re: Cancel the flink task and restore from checkpoint ,can I change the flink operator's parallelism

2020-03-12 Thread Sivaprasanna
I think you can modify the operator’s parallelism. It is only if you have set maxParallelism, and while restoring from a checkpoint, you shouldn’t modify the maxParallelism. Otherwise, I believe the state will be lost. - Sivaprasanna On Fri, 13 Mar 2020 at 9:01 AM, LakeShen wrote: >

Re: SerializableHadoopConfiguration

2020-03-05 Thread Sivaprasanna
ls with Hadoop based compression. On Fri, Mar 6, 2020 at 1:33 AM João Boto wrote: > We could merge the two modules into one? > sequence-files its another way of compressing files.. > > > On 2020/03/05 13:02:46, Sivaprasanna wrote: > > Hi Stephen, > > > > I guess it

Re: SerializableHadoopConfiguration

2020-03-05 Thread Sivaprasanna
gt; > If yes, does it make sense to create a "flink-hadoop-utils" module with > exactly such classes? It would have an optional dependency on > "flink-shaded-hadoop". > > On Wed, Mar 4, 2020 at 9:12 AM Till Rohrmann wrote: > > > Hi Sivaprasanna, > > &

Re: SerializableHadoopConfiguration

2020-03-03 Thread Sivaprasanna
unity follows? - Sivaprasanna On Tue, Mar 3, 2020 at 10:48 PM Sivaprasanna wrote: > Hi Arvid, > > Thanks for the quick reply. Yes, it actually makes sense to avoid Hadoop > dependencies from getting into Flink's core modules but I also wonder if it > will be an overkill to

Re: SerializableHadoopConfiguration

2020-03-03 Thread Sivaprasanna
Hi Arvid, Thanks for the quick reply. Yes, it actually makes sense to avoid Hadoop dependencies from getting into Flink's core modules but I also wonder if it will be an overkill to add flink-hadoop-fs as a dependency just because we want to use a utility class from that module. - Sivapra

SerializableHadoopConfiguration

2020-03-03 Thread Sivaprasanna
can be used by many other modules, for ex. flink-compress. Thoughts? - Sivaprasanna

Regarding access to Jira

2020-03-02 Thread Sivaprasanna
Hi, Can you please give me access to Flink's Jira board. I created a ticket FLINK-16371 <https://issues.apache.org/jira/browse/FLINK-16371> and want to assign it to myself. Thanks, Sivaprasanna

[jira] [Created] (FLINK-16371) HadoopCompressionBulkWriter fails with 'java.io.NotSerializableException'

2020-03-02 Thread Sivaprasanna Sethuraman (Jira)
Sivaprasanna Sethuraman created FLINK-16371: --- Summary: HadoopCompressionBulkWriter fails with 'java.io.NotSerializableException' Key: FLINK-16371 URL: https://issues.apache.org/jira/browse/F