Hello all,

1st question:
Is there any way to know the count or the content of a "Fink DataSet" without using count() or collect()? The problem is that I have a loop which the number of iterations depends on the count of a DataSet. Using count() may force the whole pipeline to be executed again. I do not like to use delta or bulk iteration.

2nd question:

Using the "Aggregations.Max" on a DataSet of Tuple2<String, Integer> on the second field, I observed that the second field is the real maximum of the whole dataset while the first field is not the corresponding one to the second!!!

Best,
Alieh

Reply via email to