Thank you. Any thoughts on the ParallelIteratorInputFormat in Flink?
On Thu, Feb 25, 2016 at 12:07 PM, Márton Balassi
wrote:
> Hey Saliya,
>
> I recommend using DataSetUtils.zipWithIndex for this task. [1] It comes
> with flink-java.
>
> [1]
> https://github.com/apache/flink/blob/master/flink-ja
Hey Saliya,
I recommend using DataSetUtils.zipWithIndex for this task. [1] It comes
with flink-java.
[1]
https://github.com/apache/flink/blob/master/flink-java/src/main/java/org/apache/flink/api/java/utils/DataSetUtils.java#L77
On Thu, Feb 25, 2016 at 5:52 PM, Saliya Ekanayake wrote:
> Thank y
Thank you, Marton. That seems doable.
However, is there a way I can create a dummy indexed data set? Like a way
to partition the index range without data across parallel tasks. For
example, if I could have something like,
DataSet ds = ...
then I can implement a custom method to load required dat
Hey Saliya,
I would add a uniqe ID to both the DataSets, the variable you referred to
as 'i'. Then you can join the two DataSets on the field containing 'i' and
do the mapping on the joined result.
Hope this helps,
Marton
On Thu, Feb 25, 2016 at 5:38 PM, Saliya Ekanayake wrote:
> Hi,
>
> I've
Hi,
I've two data sets like,
DataSet a = ...
DataSet b = ...
They have the same type and same decomposition. I want to apply a map
operator that need both *a* and *b. *For example,
a.map( i -> OP)
within this OP I need the corresponding (*i *th) element of *b* as well. Is
there a way to do thi